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@) Vector. 



@ The invention relates to vectors which include an inducible selection gene and a sequence which 
codes for a heterologous polypeptide. In a particular example the selection gene comprises the tetA and 
tetR genes. The vector may also include a sequence which is capable of confenring stability on the 
vector, such as the cer sequence, a transcription terminator, such as gene 32 from bacteriophage T4, a 
ribosome binding site, and a multi-cloning site. Hosts transformed with the vector, processes for 
preparing these hots, and processes for preparing polypeptides using these hosts. 
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The present invention relates to a vector which Includes an inducible selection gene, a host containing the 
vector and processes for preparing the vector and host. 

The majority of bacterial cloning and expression vectors contain an antibiotic resistance marker as a simple 
means to maintain selection for the plasmid in the bacterial host of choice. This is commonly ampiclllln because 
5 of the fact that one of the original cloning vectors constmcted, pBR322, canries an ampiclllln resistance deter- 
minant (Bolivar et a[, 1 977, Gene 2: 95-1 13). One of the derivatives of this plasmid Is pAT153 (Twigg and Sher- 
ratt, 1 980, Nature 283 : 21 6-21 8). The amplcillin resistance marker Is constltutlvely controlled as are the majority 
of plasmid encoded resistances. 

Although a number of selection systems have been used in cloning vectors for the preparation of potypep- 
10 tides, there Is still a need for improved selection systems. 

According to the present Invention there is provided a vector which comprises an inducible selection gene, 
and a sequence which codes for a heterologous polypeptide. 

The term Vector** as used herein is used in Its broadest sense and includes within its meaning any replicon 
capable of transferring recombinant DNA material from one cell to another. The present Invention includes vec- 
15 tors suitable for Integration Into a host, and vectors, such as plasmids, which are useful in constructing vectors 
for transfer of recombinant DNA to hosts. 

The inducible *'selectlon gene** (or selection marker) will, in general, include a gene which facilitates selec- 
tion and is Inducible. The inducible selection gene may comprise a first gene which codes for a substance which 
facilitates selection, and a second gene which controls expression of that substance so that expression only 
20 occurs under defined conditions. Thus, for example the first gene may comprise a gene which codes for a sub- 
stance which confers resistance to an antibiotic and the second gene may comprise a gene which codes for a 
repressor. In the presence of the antibiotic the system is induced and expression of the first gene takes place 
to confer antibiotic resistance, thus permitting recombinant vectors which carry the selection gene to be selec- 
ted. In the absence of the antibiotic, repression occurs so that expression of the first gene does not take place 
25 and the substance which confers antibiotic resistance in the vector Is not produced. 

A particular example of a suitable inducible selection gene Is one which Includes the tetA and tetR genes. 
The tetA gene Is the gene which codes for a substance which confers resistance to tetracycline; whilst the tetR 
gene codes for a repressor protein which Is able to prevent expression of the tetA gene. The system Is Induced 
In the presence of tetracycline, so that In its presence the tetA gene Is expressed thus conferring tetracycline 
30 resistance on the vectors which carry the selection gene. In the absence of tetracyllne, the tetA gene is rep- 
ressed by the tetR gene so that expression of tetA does not take place and the product of this gene Is not gen- 
erated. 

Thus in particular embodiment of the present Invention there Is provided a vector which comprises an induc- 
ible selection gene which comprises the tetA and tetR genes, and a sequence which codes for a heterologous 
35 polypeptide. 

The vector may, in addition to the sequences mentioned above include other DNA sequences suitable for 
particular applications, such as appropriate control sequences. For example the vector may Include a promoter, 
ribosome binding site and transcription terminator sequence. The vector will, in general, include an origin of 
replication, for example that derived from plasmid pAT153. 

40 A particular example of a suitable promoter is the tryptophan (trp) promoter. Other promoters may be used. 

For example, in a further embodiment of the present invention the vector Includes the T7A3 promoter (SEQ ID 
NO 42), In which case the vector may optionally also include an operator such as lacO (especially the shortened 
lacO sequence of SEQ ID NO 43). The T7A3 promoter sequence shown In SEQ ID NO 42 is shown up to the 
base before the beginning of mRNA, so that when used with the lacO sequence the lacO sequence of SEQ ID 

45 NO 43 extends from +1 (beginning of mRNA). 

A particular example of a transcription terminator is a derivative of the transcription terminator sequence 
found in bacteriophage T4 gene 32. 

A sequence which confers stability on the vector may also be present. An example of such a sequence is 
the cer sequence (see, for example. Cell, 36, 1097-1103, 1984). 

50 The vector may include a multicloning sequence to facilitate the introduction of ribosome binding sequ- 

ences, and genes for heterologous polypeptides etc. 

The heterologous polypeptide may comprise a polypeptide which possesses pharmacological properties, 
and Is hence of use in medicine. A particular example of such a polypeptide is ricin A which may be used In 
the preparation of Immunotoxins. A further example Is a polypeptide known as G-CSF or an analogue thereof. 

55 Granulocyte colony stimulating factor (G-CSF) has been described in the literature by Wallet K. et al Proc. 

Natl. Acad. Sci. USA Vol 82, pp 1526-1530 and has also been described in European Patent Publication No 
169.566 and PCT Patent Publication No WO 87/01132. G-CSF has been shown to stimulate granulocyte pro- 
duction in vivo and to function with minimal side effects. As a result human G-CSF Is seen as having potential 
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utility in the management of neutropaenia associated with chemotherapy, radiation therapy, radiation accident 
or autologous bone marrow transplantation. Moreover G-CSF may have utility in the stimulation of bone marrow 
suppression associated with AIDS, in the treatment of myelodysplastic syndromes characterised by granulo- 
cyte functional abnormalities and as an adjunct to the treatment of severe infections. 

The term "human G-CSF" as used herein refers to those G-CSFs that have been found to exist in nature 
and includes the two polypeptides having the amino acid sequence set out in SEQ. ID No 41. These two 
polypeptides differ only in so far as a tripeptide Insert Val-Ser-Glu is present in one polypeptide between posi- 
tions 35 and 36» but absent in the other. The numbering used throughout the present specification is based on 
the naturally occunring polypeptide without the Val-Ser-Glu insert. 

Analogues of G-CSF include polypeptides which differ from that of naturally occurring G-CSFs in terms of 
the identity or location of one or more amino acid residues. For example, such analogues may contain sub- 
stitutions, or terminal or intemnediate additions or deletions of such residues. Such analogues would share the 
property of natural G-CSFs of being able to stimulate granulocyte production. 

In particular the present invention provides a replicable expression vehicle which comprises a vector as 
hereinbefore defined. 

In a particular embodiment of the present invention there is provided a replicable plasmldic expression ve- 
hicle which comprises an inducible selection gene comprising the tetA and tetR genes, and a DNA sequence 
which codes for a heterologous polypeptide. 

As indicated above, the expression vehicle may include a sequence capable of conferring stability on the 
expression vehicle, such as the cer sequence. 

In a further embodiment of the present invention there is provided a vector which comprises a replicable 
plasmldic expression vehicle comprising a promoter, the cer sequence, a transcription terminator as found in 
the terminus of gene 32 bacteriophage T4, an origin of replication and a DNA sequence which codes for a 
heterologous polypeptide. 

According to the present invention there is also provided a process for preparing a polypeptide, said pro- 
cess comprising culturing a host which comprises a vector of the present invention so that the polypeptide is 
expressed. 

The process may be carried out in the absence of the product used in selection. For example, in the case 
where the selection system comprises the tetA and tetR genes, the process may be earned out in the absence 
of tetracycline. - 

The above-mentioned process may be effected by the use of any appropriate host cell, such as bacterial, 
yeast, or mammmalian cells. A particular example of a suitable host comprises bacterial cells, for example E. 
coll . 

It will be appreciated that where the desired metabolite is not passed out of the host at a useful rate, the 
host may be cultured and harvested as the Intact cell and the desired polypeptide recovered by subsequently 
extracting the cells, for example after separation from the medium containing nutrients necessary for growth 
of the host cell. Where the metabolite is passed out of the host cell into the surrounding culture solution, then 
the polypeptide may be recovered by extraction in the normal way. 

According to the present invention there is also provided a host capable of expressing a heterologous 
polypeptide, which host comprises a vector (such as replicable plasmldic expression vehicle) as herein defined. 

In a particular embodiment of the present invention there is provided a host transformed with a replicable 
plasmidic expression vehicle which comprises an inducible selection gene comprising the tetA and tetR genes, 
and a DNA sequence which codes for a heterologous polypeptide. 

According to a further feature of the present invention there is also provided a process for the preparation 
of a host as hereinbefore defined, said process comprising transforming a host by the insertion therein of a 
vector (such as a replicable plasmidic expression vehicle) as hereinbefore defined. 

Suitable methods for the introduction of foreign genetic material into a host are known from the literature. 
Such methods include formation of a replicable expression vehicle comprising a vector and the foreign genetic 
material, and introduction of the vehicle into the host. Introduction of the vehicle into the host may be facilitated 
by subjecting the host to an appropriate treatment, for example in the case of E. coli , by treatment with calcium 
chloride solution. 

The present invention also provides a process for the preparation of a vector as herein defined comprising 
inserting a sequence which codes for the desired polypeptide into a vector (as herein defined) at an appropriate 
insertion site so that a vector (conveniently in the form of a replicable plasmidic expression vehicle) is obtained 
which is capable of directing synthesis of the polypeptide. 

According to a further aspect of the present invention there is provided a vector which comprises an induc- 
ible selection gene. The selection gene may be as defined hereinbefore, for example it may comprise the tetA 
and tetR genes. 
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The vector Of the present invention utilises an Inducible selection gene. This has been found to be particu- 
lariy advantageous since the product of this gene (tetA in the preferred embodiment) is only expressed during 
the construction and testing phases of the genetic manipulation. If the subsequent plasmid carrying the cloned 
gene is stably maintained in its bacterial host, the need for selection ceases. Cultures grown to express the 
cloned gene product will therefore not require addition of the selection drug and will consequently not express 
the product of the selection gene. Such a product is unavoidable in most vectors because they carry constitu- 
tively expressed selection genes. Such unwanted products are disadvantageous because they divert metabolic 
energy away from the cloned gene product and may produce undesirable contaminants. In particular the vectors 
of the present invention avoids the use of penicillins as selection markers. This is particularly advantageous 
because of the prevalence of allergic reactions to penicillin or its breakdown products in human populations. 
Presence of a p-lactamase encoded by the plasmid would prevent the simple detection of any such contami- 
nating p-lactams as active antibiotics. 

The use of the teWtetR genes as a selection system has been found to particulariy advantageous as, in 
general, the vectors which contain this selection system are unexpectedly stable. This stability helps to maintain 
15 expression levels and to improved accumulation of polypeptides, such as ridn A. 

The stablity of the vectors of the present invention is exemplified by plCI0042 which carries the tetA/tetR 
selection gene. This plasmid was unexpectedly found to have gained stability over its parent pAT153, even 
without the presence of the cer sequence. This Is an unexpected but very welcome advantage of the construc- 
tion of this plasmid. 

The invention will now be further described, by way of example only, with reference to the following Exam- 
ples; and accompanying drawings in which: 

Figure 1 illustrates transcription terminator sequences; 
Figure 2 illustrates the preparation of pTB344; 
Figure 3 illustrates the preparation of plCI0042; 
25 Figure 4 is a plasmid map of plCI0042; 

Figure 5 describes fragments used in the preparation of plasmids; 
Figure 6 is a plasmid map of picn 079; 
Figure 7 is a plasmid map of pLB015; 
Figure 8 Is a plasmid map of pCGG1 ; 
30 Figure 9 describes the sequence of [Ser^wjQ.QSF; 

Figure 10 describes the sequence of h-GCSF; 
Figure 11 is a plasmid map of pCG54; 
Figure 12 illustrates the construction of plCI0020; 
Figure 13 illustrates the construction of pICII 078; 
35 Figure 14 illustrates the construction of plCI1 102; 

Figure 15 illustrates a coomassie stained SDS gel of E.coli lysates in which track A is plCI1102; B is 
PICI0020, and C is molecular weight markers. 

Figure 16 illustrates a gel profile of pICI1 102 in which peak R represents ricin A; 

Figure 17 is a western blot of ricin A produced by plCI1102 and in which track 1 is rnol ecu I ar weight markers; 
2 and 3 are non-ricln producing clones; 4 is pICI1 1 02; and 5 is plCI0020 (control plasmid - no ricin A sequ- 
ence); 

Figure 18 is a partial sequence of plCI1102; and 
Figure ^19 illustrates the contruction of plCI1187. 

The sequences referred to are set out in the "Sequence Listing" following the Examples and sequences 
45 are specified in the conventional 5' to 3' sense. 

BUFFERS FOR RESTRICTION ENZYMES 

Stability: stable at -20°C. 
50 Buffer composition: 



40 
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Buffer components Final concentration in mmol/1 

(1:10 diluted set buffer) 



10 



45 





B 


M 


H 


Tris-HCl 


10 


10 


50 




5 


10 


10 


NaCl 


100 


50 


100 


Dithioerythritol (DTE) 




1 


1 


2-Mercaptoethanol 


1 






pH at 37«>C 


8.0 


7.5 


7. 



15 

The above buffers are available from Boehringer Mannheim. 
In the site-directed mutagenesis procedure - Example 7 
Buffer 1 1 00 mM Tris HCI pH 8.0 
20 lOOmMNaCI 

20 mM MgCl2 
Buffer 2 10 mM Tris HCI pH 8.0 

20mMNaC! 

1 mM EDTA 
25 Buffers 12 mM Tris HCI pH 7.7 

30 mM NaCI 

10 mM MgCl2 

8 mM 2-mercapto ethanol 
Buffer 4 60 mM Tris HCI pH 8.0 
30 90 mM NaCI 

6 mM MgCl2 

lOmMDTT 

Nucleotide mix 1 250 nM each of dATP. dGTP. dCTP=S (phosphorothioate derivative of dCTP). dTTP and 1 
mMATP 

35 Nucleotide mix 2 250 mM each of dATP. dGTP. dCTP. dTTP and 350 ^M ATP 

M9 minimal media 

40 Ammonium chloride Ig 

Disodium hydrogen orthophosphate 6g 

Potassium dihydrogen orthophosphate 3g 

Sodium chloride 0.5g 

In distilled water 1 1. 

Supplements/75ml 

50 300 ^l 50% glucose 

75^1 1MMgS04 

75 ^l 0.1MCaCl2 

75 ^l 4 mg/ml thiamine 

75 ^1 20% casein amino acids 

55 
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55 



Trace Element Solution (TES) 

TES has the following composition: 



5 




mg/10 ml 






deionised water 


10 


AlCl^.eH^O 


2.0 


COC12.6H20 


0.8 




KCr(S0^)2.12H2O 


0.2 




CuC1^.2H20 


0.2 


15 


H3BO3 


0.1 




KI 


2.0 




MnSO, .H«0 
A 2 


2.0 


20 


NiS0^.6H20 


0.09 




Na2Mo0^.2H20 


0.4 




4 2 


0.4 



25 Geneclean (TM) 

The kit contains 1) 6M sodium iodide 2) a concentrated solution of sodium chloride, Tris and EDTA for mak- 
ing a sodium chloride/ethanol/water wash; 3) Glassmilk (TM)- a 1 .5 ml vial containing 1 .25 ml of a suspension 
of silica matrix In water. 



This is a technique for DNA purification based on the method of Vogelstein and Gillespie published In Pro- 
ceedings of the National Academy of Sciences USA (1 979) Vol 76. p 61 5. 

Altematively any of the methods described in "Molecular Cloning - a laboratory manual" Second Edition 
Sambrook. Fritsch and Maniatis (Cold Spring Harbor Laboratory, 1989) can be used. 

35 Random Label Kit Product of Phanmacia No 27-9250 

The procedure is described in "Molecular Cloning - a Laboratory Manual" Second Edition, Sambrook 
Fntsch and Maniatis, pp 10.13-10.17 (Published by Cold Spring Harbor Laboratory 1989). 

40 Sequenase (TM) 

Chemically modified T7 DNA polymerase 

Based on the procedure of Tabor and Richardson published in "Proceedings of the National Academy of 
45 Sciences USA (1987) vol 84 pp 4767-4771 . 

T4 DNA ligase 

Described in "Molecular Cloning - a Laboratory Manual" Second Edition, Sambrook, Fritsch and Maniatis 
50 5.60-5.64 (Published by Cold Spring HariDor Laboratory 1989) and also by Weiss B. et al J. Biol Chem Vol 
243 p 4543 (1968). 

E.coli strains 



The E,coli strains HB101 and CGSC 6300 (also referred to herein as MSD522) are freely available. Thus 
for example they may be obtained from the E. co[i Genetic Stock Centre, Yale University. USA. Moreover E, 
coli HB101 may additionally be obtained from for example BRL supplied by GIBCO Limited Unit 4 Cowley Mill 
Trading Estate. Longbridge Way, Uxbridge, UB8 2YG, Middlesex, England or GIBCO Laboratories Life 
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Technologies Inc., 31 75 Staley Road, Grand Island, NY 14072. USA. The genotype of strain HB1 01 is described 
in the aforementioned "Molecular Cloning - A Laboratory Manual" as Sup E44 hsd S20 (re" mB")rec A 13 ara-14 
Fleu 6 thi-1 proA2 lac Y1 gal K2 rps L20 xyl'S mtn. The genotype of MSD 522 (CGSC 6300) is set out in 
Example 3. 

5 The following non-limiting Examples are given by way of illustration only. 

EXAMPLE 1 

Preparation of plasmids containing trp promoter/tetA/tetR genes. 

10 

(a) Preparation of plCI0042 

Many plasmid vectors are based on one of the original cloning vectors: pBR322 (Bolivar et al , 1977, Gene 
2: 95-1 13). The non-mobilizable pAT153 is.a derivative of this (Twigg and Sherratt, 1980. Nature 283 : 216-218). 
IS Both these plasmids contain the amplcfllin resistance determinant, TEM p-lactamase. 

Plasmid plCI0042 utilises a repressed tetracycline resistance detenminant, as found on the naturally-occur- 
ing plasmid RP4. This repressed system shuts off expression of the tetA gene in the absence of tetracycline 
whereas most drug resistant mechanisms have constitutive expression. 

The tet locus was first mapped on RP4 by Barth and Grinter (J.Mol. Biol.113 : 455-474, 1977). This was 
20 shown to consist of adjacent genes: tetA . the structural resistance gene and tetR, the repressor gene and this 
region has been sequenced (Klocl< etal, J. Bacterid : 161 :326-332, 1 985). These genes are located on adjacent 
Bglll-Smaland Smal-Smal fragments. The Bglll site is unique in RP4 but there are five Smal sites (Lanka. Lurz 
and Furste, Plasmid 10 : 303-307. 1983). 

25 (i) Cloning the tetA + tetR genes 

The plasmid RP4 is well documented (Datta et al. J. Bacterid 108 : 1244. 1971) and is freely available. Fur- 
thermore, the plasmid RP4 has been deposited with the National Collection of Type Cultures, 61 Colindale 
Avenue, London. NW9 5HT under accession numbers 50078 and 50437. RP4 obtained from N Datta (National 

30 Collection of Type Cultures) was used herein. E. cdi strains containing this plasmid were grown in selective 
broth cultures and plasmid DNA was isolated by a scale-up of the Holmes and Quigley method (Holmes and 
Quigley, Anal. Biochem 114: 193-197, 1981). It was deproteinized by freatment with 2.5M ammonium acetate 
and reprecipitated with isopropanol. This plasmid DNA was treated, according to the supplier's recommended 
conditions, with restriction enzyme Bglll and cut to completion. It was then partially cut by Xmal by using diluted 

35 enzyme and short incubation times. Xmal is an isoschizomer of Smal but which produces 4-nucleotide cohesive 
ends at its cut sites. 

The vector plasmid pUC8 (Yanisch-Penron. Vieira and Messing. Gene 33: 103-119, 1985) was similariy 
prepared and cut with BamHI and Xmal to completion. The RP4 fragments were cloned into this vector by liga- 
tion with T4 ligase at 1 2°C for 1 6 hours. This was used to transfomn E. cdi C600 made competent by the calcium 

40 Chloride nriethod (Maniatis etal. Cold Spring Harbor Laboratory, 1 982). Cultures were then plated onto medium 
which selected for tetracycline resistance. 

Exdi C600 is freely available from numerous sources including many culture collections such as the E.coli 
Genetic Stock Centre. Yale University, USA under accession number GCSC 3004. The genotype of E.coli C600 
is K12 thr-1 ieuB6 thi-1 lacYI tonA21 X' supE4 4. 

45 Several colonies with this resistance were checked for the expected phenotype (ampicillin and tetracycline 

resistance but not the kanamycin resistance indicative of RP4 itself). Colonies with the correct resistances were 
subjected to clone analysis by isolating plasmid DNA (Holmes and Quigley method). These preparations were 
cut with EcoRI and Hindlll and analysed by gel electrophoresis. This established the size of the cloned insert 
which was found to be the 2.45 kb predicted for the Bglll - Xmal - Xmal fragment from RP4. A done canying 

50 this fragment containing the tetA and tetR genes was designated pTB344. (Figure 2) 

(ii) Removal of the tet gene from pAT153 

It was necessary to remove the tet gene from the vector plasmid pAT153 before inserting the tetA + tetR 
55 cassette from RP4 to prevent gene duplication which can be a source of genetic instability. Also the tet gene 
may not be effectively suppressed by the non-cognate tetR. The removal was done by isolating plasmid pATI 53 
DNA and cutting it with EcoRI and Aval . Between these sites, synthetic oligonucleotides with the following sequ- 
ence (SEQ IN NO. 40):- 

7 
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5 ' AATTCGCATGCGGATCCATCGATC3 ' 

3 ' GCGTACGCCTAGGTAGCTAGAGCC5 ' 

5 were cloned. These fit the EcoR I and Ava l cohesive ends and contain SphI, BamHI and Clal sites in addition. 
After transformation and selection, colonies were tested for the loss of the tetracycline resistance determinant. 
Plasmid DNA from one clone was sequenced to canfirm that the predicted sequence was correct. This ptasmid 
was designated plCI0019. (Figure 3) 

10 (iii) Insertion of the tetA + tetR genes 

The tetA and tetR genes were Isolated from pTB344 on an EcoRI to PstI fragment. The pUC8 vector was 
destroyed by cutting with Sspl because It carries the same selection detenminant (ampicillin resistance) as 
plC!0019. Plasmid plCI0019 DNA was cut with EcoRI and PstI and then ligated with the 2.45kb fragment car- 
ts rying the tet genes. This was used to transfonm E.coli C600. the culture being plated out under selection for 
tetracycline resistant colonies. The insertion of the tet genes was designed to replace most of the bla gene in 
pCH19 which should thus lose its ampicillin resistance detenminant. Loss of ampicillin resistance from the trans- 
fonmants was confinmed. A few clones were then used to isolate ptasmid DNA which was subjected to restriction 
analysis. This confirmed that the constructed plasmid had the intended structure. It was designated pTB351. 
20 (Figure 3) 

(iv) Insertion of the cer sequence 

The naturally-occuring plasmid ColEI is very stably maintained in E.coli , whereas its derivatives pBR322 
25 and pAT1 53 are not. Summers and Sherratt (Cell, 36 : 1 097-1 1 03, 1 984) demonstrated that this was due to the 
derivatives not containing a short (283 bp) sequence called cer which is present in the parent plasmid. This 
sequence contains a site-specific plasmid multimer-resolution system which prevents the accumulation of plas- 
mid multimers formed by homologous recombination. Such multimers have a deleterious effect on the process 
of partition which normally ensures stable inheritance of daughter plasmids during bacterial cell division. 
30 The cer sequence (Summers,D et al MGG, 201 , p334-338. 1 985) was isolated from plasmid pKS492 (pro- 

vided by D. Sherratt) as a 289 bp fragment by cutting with BamHI and Taql . The plasmid pTB351 was isolated 
as DNA from a dam strain of E. coM to prevent its Clal site being blocked by the dam^ methylation system. This 
DNA was cut with BamHI and Clal (both these sites having been introduced on the synthetic oligonucleotide 
for this cloning). The cer fragment was ligated with the cut vector and then used to transform E. coli C600, selec- 
35 tion being made for tetracycline resistance. Transformant colonies were subjected to clone analysis by Aval 
restriction and gel electrophoresis. The presence of an extra DNA band of about 300 bp indicated the acquisition 
of the cer fragment. Further restriction analyses were used to confirm that resultant plasmids had the correct 
structure. One of these was designated plCI0042 (Figure 3 and 4). 

40 (v) Tests on pi CI 0042 

The plasmid has been completely sequenced using the DuPont Genesis 2000 machine. (Figure 3 and 4). 

The inducibility of the tetracycline resistance was checked. First the minimal inhibitory concentration (MIC) 
of tetracycline for (plCI0042)C600 was measured. This was done by inducing a culture of this strain with 0.5 
45 ^g/ml of tetracycline. Aftergrowth, this culture was serially diluted and plated onto broth media containing levels 
of tetracycline from 0 to 500 ng/ml to give about 100 colonies per plate. The MIC was found to be about 200 
^ig/ml. A culture of (plCI0042)C600 was then grown to eariy log phase in Luria broth in the absence of tetracyc- 
line. This was used to inoculate parallel cultures in broth, with and without tetracycline induction. These were 
grown with aeration at 37°C for 90 minutes to allow induced expression. Serial dilutions of these cultures were 
50 then plated onto rich medium with and without 100 (ig/ml of tetracycline. The resulting colonies demonstrated 
that the viability of the uninduced culture was 1600-fold lower than the induced culture on the tetracycline 
medium. This confirms that the tetA + tetR induction system in plCI0042 is working satisfactorily. 

The maintenance stability of pTB351 and plCI0042 in E. coli C600 were checked. This was in order to com- 
pare them with their parent plasmid pATI 53 and note the effect of the cer sequence in plCI0042. Cultures were 
55 grown without selection and samples checked for the presence of the plasmid after 50, 100 and 150 generations 
of growth. We found no plasmid loss by either strain throughout this period. Thus pTB351 , even without a cer 
sequence, appears to have gained stability over its parent pATI 53. This may be a consequence of the deletion 
of its tet gene. Naturally-occuring plasmids always have tetracycline resistance under inducible control so the 
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constitutive tet gene in pAT153 may be counterselective in the absence of tetracycline. This could be due to 
the fact that the tetracycline resistance mechanism acts as a cytoplasmic membrane export pump. When not 
required it may harm the cell by damaging the membrane structure, exporting wanted metabolites or wasting 
metabolic energy. The presence of the cer sequence in plCI0042 should contribute to plasmid maintenance 
5 stability even under the counterselective conditions of using it to express a recombinant gene at high level. 

(b) Preparation of Plasmid pCH101 

The plasmid pCH1 01 corresponds to pICI 0020 (see Example 5c) except that the EcoRI-Sall fragment (see 
10 Figure 5a) is replaced by a fragment consisting of the SEQ ID No 34 (see Figure 5b also) and the interferon 
a2 gene sequence as described by Edge M.D. et a[. Nucleic Acids Research 1983. Volll, p641 9-6435. In this 
regard the 3'-terminal ATG codon of SEQ ID No 34 immediately precedes the TGT codon which codes for cys- 
teine (amino acid 1) in the interferon 02 sequence of the above-mentioned Edge M.D. et al Nucleic Acids 
Research reference. The 5' nucleotide sequence GATCCATG and the complementary 3' nucleotide sequence 
15 GTAC are thus omitted from the nucleotide sequence of the aforementioned reference. 

(c) Insertion of an Expression Cassette into plCI0042 

An expression cassette consisting of the tr^ promoter, a ribosome binding site and the interferon 02 gene 
20 was isolated from plasmid pCHIOI (see b above) on an EcoRI to SphI restriction fragment. This was ligated 
into the production vector (plCI0042) (see above) similarly cut with EcoRI and Sphl . This DNA was used to 
transfonm a competent culture of E. coli C600 and tetracycline resistant colonies were isolated. A few of these 
were tested by DNA clone analysis for the acquisition of the SstI restriction site carried on the expression cas- 
sette. Clones positive in this respect were further tested by restriction mapping to check that the expected con- 
25 struct was correct They were also checked for the confenred capacity to produce interferon 02 protein as 
analysed on a polyacrylamide-SDS gel stained with Coomassie blue. One such confinmed clone was desig- 
nated pLB005. 

(d) Insertion of T4 transcription terminator into pTB244 

The T4 transcription terminator sequence in the fonn of the Sail to Hindlll fragment (67 bases pairs long) 
(see SEQ ID No. 33 and Figure 1b) was inserted into the multicloning site of an intenmediate vector pTB244 
(described in European Patent Publication No. 237.269) between its Sail and Hindlll sites. Clone analysis was 
used to confirm the structure of this construct (pTB244-T4 ter). From this vector, an Sst[ to Sphl fragment con- 
35 taining most of the multicloning site and the T4 terminator was then isolated. This was inserted into pLB005 
similarly cut with SstI and Sphl thereby substituting the interferon az gene but leaving a cassette consisting of 
the tTj^ promoter, multicloning site and T4 terminator. This construct was confirmed by clone analysis and the 
plasmid designated pLB013. 

40 (e) Substitution of the multicloning site 

The multicloning site in pLB013 is not ideal for this vector in several respects: the Sail BamHI and Smal 
sites are not unique but exist elsewhere on the plasmid. This fragment was therefore excised by cutting with 
SstI and Xbal (both unique) and synthetic oligonucleotides with the sequence of SEQ ID No. 35:- 

45 

5 ' AGCTCCATATGGTACCAGATCTCTCGAGAGTACTT 

GGTATACCATGGTCTAGAGAGCTCTCATGAAGATC 5 ' 

50 were inserted in its place. Clones were analysed for acquisition of the new restriction sites and then confirmed 
by sequencing. One such plasmid was designated pLB014. The new cloning sites inserted in this way are: Ndel , 
Kpnl, Bglll , Xhol and Seal with the previous Xbal and Sail following them. 

(f) Further modification 

55 

It was discovered that the adjacent SstI and Ndel sites in pLB014 could not be cut by both these restriction 
enzymes either simultaneously or sequentially presumably because of their close proximity. An additional sequ- 
ence was therefore inserted between them. This was done by cutting pLB014 with SstI and Kpnl and then insert- 
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ing the synthetic oligonucleotide of SEQ ID No. 36> 

5' AGCTCAGCTGCAGCATATGGTAC 
5 GTCGACGTCGTATAC 5' 

Clones were analysed for acquisition of an extra Pvull or PstI site and then confirmed by sequencing. One 
such plasmid was designated pLB015 (= plCI0080) (see Figure 7). This plasmid, unlike pLB014, is efficiently 
cut by SstI and Ndel . This is to provide a place to insert a variety of ribosome binding site sequences correctly 
10 positioned with respect to the upstream trp^ promoter and with Ndel designed to provide the ATG start codon 
of the gene to be expressed. 

EXAMPLE 2 

15 Preparation of [Arg^\Ser^7-27,6o.65]human G-CSF using vector including trp promoter 

a) Plasmid plCI1239 (described in Example 7) was digested with EcoRI and Sal! in buffer H as described 
previously. The small EcoRI-Sall fragment containing the trp promoter, ribosome binding site and gene for 
[Argil, Ser^7,27,6o,65]hu g-CSF was isolated from a 0.7% agarose gel by use of Geneclean(TM). A vector frag- 
ment was prepared from pICI 0080 (see Example 1f) by digestion with EcoRI and Xhol in buffer H and the large 

20 EcoRI-Xhol fragment isolated from a 0.7% agarose gel by use of Geneclean(TM). The small EcoRI-Sall frag- 
ment was ligated into the EcoRI-Xhol vector fragment, using a 2:1 molar excess of insert to vector as described 
previously and the ligation mix used to transform E. coli strain MSD 522. Transformants were selected for growth 
on L-agar plates containing tetracycline (15|ig/ml). Three colonies were selected and grown up in M9 minimal 
media (75ml) containing supplements and tetracycline (15^g/m1) at 37°C for 20 hours on a reciprocating 

25 shaker. Protein accumulation was measured by scanning Coomassie blue stained SDS-PAGE gels of whole 
cell lysate. All three clones expressed [Arg^ ^Ser^ ^-27,60,65] |-,y G-CSF. Plasmid DNA from one of the colonies was 
designated plCI1327 and the sequence of the promoter and gene confirmed by standard dideoxy sequencing 
procedures as described previously. 

30 b) Fermentation 

pICI 1327 was transfonmed into E. coli strain MSD 522 and the resultant recombinants purified and main- 
tained on glycerol stocks at -BO^'C. 

An aliquot of the culture was removed from stock and streaked onto agar plates of tetracycline to separate 
35 single colonies after overnight growth at 37^C. A single desired colony was removed and resuspended in 10 
ml tetracycline broth and 100)il immediately inoculated into each of 3 250 ml Erie nmeyer flasks containing 75 
ml tetracycline broth. After growth for 16h at 37*^0 on a reciprocating shaker the contents of the flasks were 
pooled and used to inoculate a fermenter containing 20L growth medium. 

40 



45 



50 



55 
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Composition of Growth Medium 



10 



15 



20 



25 



30 



35 



40 



45 



50 



NajHPO^ 
NaCl 

Casein hydrolysate (Oxoid L41) 
(NH^)2S0^ 

Yeast Extract (Difco) 

Glycerol 

L-Leucine 

MgSO^. 7H2O 

CaCl^. 2H2O 

Thiamine 

FeSO^/Citric Acid 

Trace element solution (TES) 

Tetracycline 



Made up of distilled water 

3.0 

6.0 

0.5 

2.0 
10.00 
10,00 
35.00 

0.625 

0.5 

0.03 

0.008 

0. OA/0. 02 
-1 



0.5ml 1 
lOmg 1 



-1 



Fennentations were then carried out at a temperature of 37**C, and at a pH, controlled by automatic addition 
of 6M sodium hydroxide solution, of pH 6.7. The dissolved oxygen tension (dOT) set point was 50% air-satu- 
ration and was initially controlled by automatic adjustment of the fermenter stirrer speed. Air flow to the fer- 
menter. initially 20Lymin, corresponding to 1 volume per volume per minute (WM) was increased to SOUmin 
(2.5 WM) when the fermenter stirrer speed approached 80-90% of its maximum. Since the oxygen transfer 
rate (OTR) of the femnenters was unable to meet the oxygen uptake rate (OUR) of the bacteria at a cell density 
greater than that corresponding to an OD550 of 50 under the conditions described, dOT in the fermenter at cell 
densities greater than this was maintained at 50% air-saturation by restricting bacteria oxygen uptake rate. This 
was achieved by formulating the medium to become carbon-limited at OD550 of 50 and then supplying a'feed 
of the limiting carbon source, together with ammonium sulphate and yeast extract, at a rate which restricted 
bacterial growth rate. 

Fermentations were performed for 1 8h and during that time samples were taken for measurement of optical 
density (OD550). cell dry weight and accumulation of [Argi\Seri 7.27.60,65] human G-CSF within the cells. 
[Argil, Seri7.27,6o,65]human g-CSF accumulation was measured by scanning Coomassie blue stained SDS-PA- 
GE gels of whole cell lysates of the sampled bacteria as is well known in the art. 

When OD550 reached 35 (8.5h), casein hydrolysate solution (100g/1 Oxzoid L41) was pumped into the fer- 
menters at a rate of 0.75g/l/h. 

When OD550 reached approximately 50, the supply of carbon-source in the fermentation batch became 
exhausted leading to a rapid rise in dOT from 50% air saturation. At this point, a feed containing glycerol 
(470g/1). yeast extract (118g/1) and ammonium sulphate (118g/1) was pumped into the fermenters at a rate 
which returned and then maintained the dOT at 50% air saturation with the fermenter stirrer at ca 70-80% of 
its maximum. Casein hydrolysate feeding was maintained at 0.75g/l/h throughout. After approximately 1 8 hours, 
when microscopic examination of the culture showed the presence of large inclusion bodies within a majority 
of the cells, bacteria were harvested on a Sorval RC3B centrifuge (7000g, 30 min., 4**C) and stored frozen at 
minus 80°C. 



55 



EXAMPLE 3 

Preparation of [Argi\Seri 7^7.6o.65] human G-CSF using vector including T7A3 promoter 

a) An EcoRI-Sall fragment, containing a T7A3 promoter, a trp leader ribosome binding site sequence and 
a gene for [Seri7.27]hu G-CSF was sub-cloned into M13 mp18 as described in part d) of Example 5. The sequ- 
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ence of the EcoRI-Sall fragment is set out in SEQ ID No 32 and Figure 9. SEQ ID No 32 consists of the EcoRI 
restriction site (nucleotides 1-6), the A3 promoter sequence of bacteriophage T7 (nucleotide 7-52), the trp 
leader ribosome binding site sequence (nucleotides 53-78)and translation initiation codon (nucleotides 79-81). 
Figure 9 sets out the nucleotide sequence of [Seri^^Tjhuman G-CSF temninating in the Sail restriction site. It 
5 will be appreciated that the 3' tennina! ATG codon of SEQ ID No 32 immediately precedes the ACT codon which 
codes for threonine (amino acid 1) in Figure 9. The 5' nucleotide sequence AATTCAGT is thus absent from 
the EcoRI-Sall fragment The EcoRI-Sall fragment may also be prepared by excision from pICI 1295 (see 
Exannple 8). Site-directed mutagenesis was performed on single-stranded DNA as described in the protocol 
described in Example 7 using oligonucleotide SEQ ID No 28 to convert the codon for Gin at position 1 1 to Arg. 

10 Double-stranded RF DNA was prepared from a plaque containing the Glnii-»Argi^ change as described in 
Example 6, except that at step B3 incubation was for 3 hours instead of 5 hours, and digested with EcoRI (as 
described previously) and SnaBI (10 units, IxM buffer, BCL, 30^tl. 2 hours, 37*>C). The resulting 144 bp Eco- 
RI-SnaBI fragment containing the T7A3 promoter, trp leader ribosome binding site sequence and gene fragment 
with Argii codon was isolated and ligated to an EcoRI-SnaBI cut vector from pIC1 1 327 (which contains codons 

15 for Ser^o and Ser^ and is described in Example 2). The ligation mix was used to transfonm E.coli strain MSD522 
and transfomiants selected for growth on L-agar plates containing tetracycline (15ng/mg). Plasmid DNA from 
a colony containing the expected T7A3 promoter and [Argii Ser^ 7^7.60,65] hu G-CSF gene sequence were iden- 
tified by sequencing DNA from the isolated plasmid and designated pICI 1386. 

The fermentation was effected according to two alternative processes (b) and (c) below. Process (b) was 

20 effected at 37°C and after 16 hours fermentation as described, microbial biomass was 35 g/l and 
[Argil, Ser^7,27.6o,65]human g-CSF was estimated to be accumulated to 7g/l femnentation broth. Process (c) was 
effected at 30*'C and the fermentation was accordingly slower because of the lower fermentation temperature. 
With regard to process(c), after 35 hours, the microbial biomass was 55 g/l and the [Arg ".Serl^^^.so^esj human 
G-CSF yield was estimated to be accumulated to 15 g/l femrientation broth. 

25 b) E.Coli strain CGSC 6300 (genotype F", A." Iac+) obtained from the E.coli Genetic Stock Centre was trans- 
formed with plasmid pICI 1386. The resultant strain CGSC 6300 (pICI 1386) was purified and maintained in 
glycerol stocks at -80^C. An aliquot of the culture was removed from stock and streaked onto agar plates of 
L-tetracycline to separate single colonies after overnight growth (16h) at 37°C. 

A single colony of CGSC 6300 (pICI 1386) was removed and resuspended in 10ml L-tetracycline broth and 
30 lOOjil immediately inoculated into each of twenty 250ml Erienmeyer flasks containing 75ml of L-tetracycline 
broth. After growth for 16h at 37°C on a reciprocating shaker the contents of the flasks were pooled, and used 
to inoculate a femienter containing 20 litres of modified LCM50 growth medium. The composition of the growth 
medium is in Table 1. 

35 



40 



45 



50 
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TABLE 1: Composition of growth medium 
Modified LCM50 Growth Medium (A) 

made up with distilled water 
g/1 



10 l^H^PO^ 3.0 

Na^HPO^ 6.0 

NaCl 0.5 

Casein Hydrolysate (Oxoid LAI) 2.0 

(NH^)2S0^ 10.0 

Yeast extract (Difco) 20.0 

Glycerol 35.0 

MgSO^.yH^O 0.5 

CaCl2-2H20 0.03 

Thiamine 0.008 

FeSO, /Citric acid 0.04/0.02 

1 

Trace element solution(TES) (0.5ml 1~ ) 

Tetracycline (10 mg I""'") 



30 The fermentation was then earned out at a temperature of 37**C and at a pH. controlled by automatic addi- 

tion of 6M sodium hydroxide solution, of pH 6.7. The dissolved oxygen tension (dOT) set point was 50% air 
saturation and was initially controlled by automatic adjustment of the fermenter stirrer speed. Air flow to the 
fenmenter was initially 20 Umin corresponding to 1.0 volume volume per minute (WM) and was increased to 
45 L/min manually when the fenmenter stirrer speed reached its maximum (1000 rpm). The fermentation was 

35 performed for 16h and during that time samples were taken for measurement of optical density of the culture 
(OD550 biomass concentration, total microbial protein concentration and accumulation of [Arg''^Ser17'27,6o,65]hu- 
man G-CSF within the bacterial cells. Accummulation was measured by scanning Coomassle blue stained 
SDS-PAGE gels of whole cell lysates of the sampled bacteria as is well known in the art. Total microbial protein 
was estimated by the method of Lowry. A solution of yeast extract (225 g/L) was pumped into the fermenter 

40 4.5h post inoculation at 1.7 g/L/h. When the supply of carbon source (glycerol) in the growth medium became 
exhausted dOT increased rapidly from 50% air saturation. At this point a feed containing glycerol (714 g/l) and 
ammonium sulphate (143 g/L) was pumped. Since the bacterial oxygen sulphate rate (OUR) approached the 
maximum oxygen transfer rate of the fermenter (OTR) just prior to the carbon source in the batch growth medium 
becoming exhausted, the feed was pumped into the fenmenter at a rate which restricted the bacterial OUR to 

45 approximately 80-90% of the fenmenters maximum OTR. The feed rate was adjusted manually to return and 
then maintain dOT at 50% air saturation under the conditions described. 

c) Thefennentation process described in (b) was repeated but at a temperature of 30°C for 35 hours. Except 
for the fermentation temperature of 30**C the medium and fenmentation conditions were identical to those des- 
cribed in (b). 

50 

EXAMPLE 4 

Preparation of plasmid pAG88 

55 a) Preparation of a synthetic gene for human G-CSF 

A DNA sequence (Figure 10) encoding the amino-acid sequence of the polypeptide of Figure 10 (human 
G-CSF) was designed according to the following considerations: 
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1) Single - stranded cohesive teimlni to allow ligation at suitable sites in a plasmid. 

2) A series of restriction endonuclease sequences throughout the gene to facilitate subsequent genetic 
manipulation. 

3) Translation tenmination codon. 

5 4) Codons at the 5'-end of the coding region were normally chosen to be A/T rich. Other codons were nor- 

mally chosen as those preferred for expression in E.coli. 

The gene was assembled from the 18 oligonucleotides designated SEQ ID No.1 - SEQ ID No.1 8 and shown 
hereinafter. 

10 Preparation of Oligonucleotides 

The oligonucleotide sequences shown hereinafter were prepared on an Applied Biosystems 380A DNA 
synthesiser from 5'-dimethoxytrityl base-protected nucleoside-2-cyanoethyl-N,N-dlisopropylphosphoramidites 
and protected nucleosides linked to controlled-pore glass supports on a 0.2 micro mo! scale, according to pro- 
fs tocols supplied by Applied Biosystems Inc. 

Altematively, the oligonucleotide sequences may be prepared by manual methods as described by Atkin- 
son and Smith in 'Oligonucleotide Synthesis, a Practical Approach' {M. T. Gait. Editor, IRL Press, Oxford, 
Washington DC. pages 35-81). 

In detail, the preparation of the oligonucleotide sequences by use of the Applied Biosystems 380A DNA 
20 synthesiser was effected as follows:- 

Each oligonucleotide, after cleavage from the solid support and removal of all protecting groups, was dis- 
solved in water (1ml). A solution of 3M sodium acetate (pH5.6; 40^^) and ethanol (1ml) was added to the 
oligonucleotide solutions (400|il) and the mixtures stored at -70°C for 20 hours. The resulting precipitates were 
collected by centrifugation (13.000rpm for 10 minutes) and the pellets washed with ethanol: water (7:3) (200^l) 
25 then dried briefly in vacuo and dissolved in water (15|il) and 10|il of a fonmamide/dye mix. (lOmM NaOH, 0.5mM 
EDTA, 0.01% Bromophenol Blue, 0.01% xylene cyanol. 80% fomnamide. 

The oligonucleotides were purified on a 10% polyacrylamide gel in 50mM Tris-borate (pH8.3) containing 
8.3M urea. Oligonucleotides of correct length were identified by UV shadowing (Narang et al. 1979 in Methods 
in Enzymology Vol 68, 90-98) - normally the most prominent band - excised from the gel and electroeluted in 
30 5mM tris-borate (pH 8.3) at 300mV for 3-4 hours. The aqueous solutions were concentrated to about 200fil by 
treatment with n-butanol (mix. spin and removal of the upper organic layer). The purified oligonucleotides were 
precipitated at -70°C for 20 hours from a 0.3M sodium acetate solution by addition of ethanol (2.5 volumes). 

Assembly of gene 

35 

Oligonucleotides SEQ ID No2 - SEQ ID No 17 (400pM of each) [as defined hereinafter] were phosphory- 
lated with T4 polynucleotide kinase (3.6 units) for 2 hours at 37**C in 25^l of a solution containing ATP (800pM 
containing 25pM gamma- ATP), lOOjiM spermidine, 20mM MgCls. 50mM Tris-HCI (pH9.0) and 0.1 mM 
EDTA. The solutions were heated at 100*^0 for 5 minutes to tenminate the reactions, then mixed in pairs as 

40 Shown in Table 1 to give duplexes A to I (Oligonucleotides SEQ ID No 1 and SEQ ID No 18 (400mM in 25\x\) 
were used unphosphorylated). 0.3M Sodium acetate (pH5.6, 200^1) and ethanol (850^l) were added and the 
duplexes precipitated at -20°C for 20 hours. The resulting precipitates were collected by centrifugation and 
washed with ethanohwater (7:3) then dissolved in water (50^!). The pairs of oligonucleotides were annealed 
together by first heating the solutions to 100^0 for 2 minutes in a boiling water bath. The bath was then allowed 

45 to cool slowly to 40°C (about 4 hours). Solutions containing 3 pairs of duplexes were combined as shown (see 
Table 1). to give groups I to III lyophilised and dissolved in 30^il of a solution containing T4 DNA ligase (1 unit; 
BRL). 50mM Tris (pH7.6), lOmM magnesium chloride, 5% (w/v) PEG 8000, 1mm ATP. 1mm DTT. (BRL, Focus 
Vol 8 no 1 Winter 1986) and the DNA ligated at 30°C for 5 minutes followed by 20 hours at 16°C. 3M Sodium 
acetate (20^l) and water (150^1) was added and the product precipitated by addition of ethanol (750^il) and 

50 cooling to -20^0 for 20 hours. The precipitate was collected by centrifugation and washed with ethanol (1ml) 
^ then dissolved in water (15fil) and fomnamide/dye mix (lOiil) and purified on a 10% polyacrylamide gel in 50mM 
Tris-borate (pH8.3), 1mM EDTA and 8.3M urea. Bands for strands of appropriate lengths (173-188 bases) were 
identified by autoradiography and isolated together by electroelution from a single gel slice as described above 
for individual oligonucleotide sequences. The DNA strands were annealed by first heating an aqueous solution 

55 (50^1) at 100**C for 2 minutes, then allowing it to cool to 40*'C over 4 hours. 

Groups I, II and lit were ligated together essentially as described for the group preparation to give as the 
product, the gene sequence shown in Figure 10. After precipitation, the gene was phosphorylated with T4 
polynucleotide kinase as described previously for individual oligonucleotides, then dissolved in water (20^l). 
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TABLE 1 



DUPLEX 



OLIGONUCLEOTIDE 



NUMBER OF BASES IN 
TOP STRAND BOTTOM STRAND 



10 



15 



20 



25 



A 


SEC 


ID No 


1 + 


SEQ ID No 2 


62 


64 


B 


SEQ 


ID No 


3 + 


SEQ ID No 4 


60 


60 


C 


SEQ 


ID No 


5 + 


SEQ ID No 6 


48 


51 


D 


SEQ 


ID No 


7 + 


SEQ ID No 8 


63 


60 


E 


SEQ 


ID No 


9 + 


SEQ ID No 10 


63 


63 


F 


SEQ 


ID No 


11 


+ SEQ ID No 12 


60 


63 


G 


SEQ 


ID No 


13 


+ SEQ ID No lA 


63 


60 


H 


SEQ 


ID No 


15 


4. SEQ ID No 16 


60 


60 


I 


SEQ 


ID No 


17 


+ SEQ ID No 18 


55 


53 


I . 


A + 


B + C 






170 


175 


II 


D 


E + F 






186 


186 


III 


G + 


H + I 






178 


173 



b) Cloning of the synthetic gene for human G-CSF 

30 The synthetic gene described above, was cloned into the plasmid vector, pSTP1 (Windass et al. Nucleic 

Acids Research, 1983. Vol 10, p6639). 

For vector preparation. 10(ig of STP1 was dissolved in water (37.5(il) and 10 x B restriction buffer (4.5^l) 
(BCL). the restriction endonuclease Sal! (3^1) (BCL, 8 units/|il) was added and the mixture incubated at 37*'C 
for 1 hour until linearised plasmid was predominant over supercoiled and nicked circular forms. The DNA was 

35 precipitated with ethanol at 4**C for 30 minutes, washed with ethanohwater (7:3) then dissolved in water (39.5^l), 
1 0X H buffer (4.5^il) (BCL). The restriction endonuclease EcoRI (1 |il) (BCL. 90 units/fi!) was added and the mixt- 
ure incubated at 37°C for 1 hour until the large EcoRI-Sall fragment was predominant The DNA was precipi- 
tated at -20°C for 20 hours, washed with ethanohwater (7:3) then dissolved in water (20m.I) 

The large EcoRI - Sal! fragment was purified on a 1% preparative agarose gel and electroeluted and pre- 

40 cipitated as described previously, then dissolved in water (20|al). For ligation of the synthetic gene, a mixture 
of vector DNA {2\i\ of the EcoRI - Sail fragment solution), synthetic gene {5\x\ of the aqueous solution described 
previously. 5X ligase buffer (6^l -250mM Tris pH7.6 50mM MgClj. 25% WA/ PEG8000, 5MM ATP, 5mM DTT 
exBRL) water (1 S^ii) and T4 DNA ligase {2\i\, 1 U^l) was incubated at 16**C for 4 hours. The DNA mix was used 
directly (either 1|il of neat ligation mix or 2|xl of ligation mix diluted 5X with water) to transform E. coli strain 

45 HB101. The DNA mixture (1 or 2^1) was added to competent E. coli HB101 cells (20fii. BRL) on ice and the 
mixture incubated on ice for 45 min then heat shocked at 42*'C for 45 seconds. After 2 mi n on ice, 100|il of 
SOC buffer (Bactotryptone 2%; Yeast Extract 0.5%; NaCI 10mM; KCI 2.5mm; MgCia. MgS04 20mm (10mm 
each); glucose 20mm) was added and the mixture incubated at 37**c for 1 hour. Aliquots of suspensions were 
plated onto L plates with SO^il/ml ampicillin. Transformants were screened for the presence of cloned synthetic 

50 gene by colony hybridisation analysis using standard methods described in "Molecular Cloning: A Laboratory 
Manual" by Maniatis et a[(Cold Spring Harbor) and in UK Patent Application No 8502605. A total of 100 colonies 
were streaked onto filters (Schleicher and Schuell), grown at 37^C for 20 hours, lysed and baked. The filter 
was hybridised at 65°C for 20 hours with a radioactive probe prepared from oligonucleotide sequence SEQ ID 
No 1 by use of a random-label kit (Pharmacia). Five colonies 1-5 giving a positive hybridisation signal were 

55 grown up in L broth at 37°C for 20 hours on a small scale (1 00ml) and plasmid DNA prepared by centrifugation 
in a caesium chloride gradient essentially as described in "Molecular Cloning; A Laboratory Manual" by Mani- 
atas et a[ (Cold Spring Harbor). 

The DNA was sequenced by the standard dideoxy chain-termination method as described by Sanger et al 
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in Proc. Nat. Acad Sci. USA 74. 5463-5467 (1977) using a Sequenase (Trade Mark) kit (United States Biochem- 
ical Corporation). Oligonucleotides SEQ 1 D No 1 9 to SEQ 1 D No 23 (as defined hereinafter and see Table 2) 
were used as sequencing primers. 



TABLE 2 



10 



CODE 



PRIMING SITE 



IS 



20 



25 



30 



35 



40 



45 



50 



55 



SEQ 


ID 


No 


19 


21A-234 top strand 


SEQ 


ID 


No 


20 


333-353 top strand 


SEQ 


ID 


No 


21 


375-395 bottom strand 


SEQ 


ID 


No 


22 


207-227 bottom strand 


SEQ 


ID 


No 


23 


69-93 bottom strand 



The plasmid DNAfrom clone 5 contained the DNA sequence shown in Figure 10. The plasmid was desig- 
nated pAG88 and was used to transfonn competent cells of the following E.coli strain HB101 and CGSC 6300 
(hereinafter also referred to as MSD 522) by standard procedures. 

EXAMPLE 5 

Preparation of M13mp18 template containing [Ser^^^T] human G-CSF gene 

The procedure for steps a) and b) in Example 4 was repeated with the following modifications: 
Oligonucleotides SEQ ID Nos 24, 25, 26 and 27 (as hereinafter defined) replace SEQ ID Nos 1, 2. 3 and 

4 (as hereinafter defined) respectively. 

c) Cloning of the gene for [Ser ^^27] human G-CSF into an expression vector 

The gene described above (see Figure 9 and SEQ ID No. 31) was cloned into plasmid vector plCI0020. 
This vector is a pAT153 based plasmid in which the 651 bp EcoRI-AccI region is replaced by a 167 bp EcoRI 
- Clal fragment (SEQ ID No.30) consisting of:- 

(1) a synthetic E. coll trp promoter and trp leader ribosome binding site 

(2) a translation Initiation codon 

(3) a multiple restriction enzyme recognition sequence derived from M13mp18, containing sites for Kpnl 
BamHI, Xbal, Sail. PstI, SphI and Hindlll 

(4) a synthetic transcription termination sequence 

The DNA sequence of this region is shown in Figure 5a. 

The PICI0020 expression vector was digested to completion with Kpnl (BCL) in 10mM Tris HCI (pH7,5), 
lOmM magnesium chloride. The DNA was precipitated with ethanol at -20°C from a solution containing 0.3M 
sodium acetate and then the 3'- sticky ends were removed by treatment with T4 DNA polymerase for 1 0 minutes 
at 37°C as follows:- 

DNA (Vg) in water (16^l) 

1 0X T4 polymerase buffer (2^1) 

0.33M Tris acetate pH7.9 

0. 1 M Magnesium acetate 

0.66M Potassium acetate 

5mM dithiothreitol 

Img/ml bovine semm albumin (BSA PENTAX fraction V) 

2mM dNTP mixture (1^l) 

T4 DNA polymerase (1^l; 2.5 units/|il BCL) 
Water {S0^\) was added and the mixture extracted with phenol/chloroform (100^l) and then with chlorofomi 
(1 00^l). The DNA was precipitated with ethanor(250^l) at -20X after addition of 3M sodium acetate (1 0^\) then 
drgested to completion with Sail (BCL) in 150mM NaCI, lOmM MgCIs and 1 0mM Tris HCI (pH7.5). The Kpn-blunt 
ended to Sail vector was purified from a 0.7% agarose gel and isolated by use of Geneclean (trademark) fol- 
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lowing the manufacturer's (Bio101. USA) recommended procedure. 

The synthetic gene was Isolated from the pSTP1 vectors as follows. The vectors were digested with Seal 
and Sail (both from BCL) in lOOmM Nad, lOmM MgCIa and 10mM Tris HCI (pH7.5). The 530 bp fragment was 
purified from a 0.7% agarose gel and isolated by use of Geneclean (trademark) following the manufacturer's 
5 (Bio101) recommended procedure. 

For ligation, a mixture of the Seal - Sail gene fragment (50ng) and the plCI0020 vector fragment (100ng) 
in 20^l of a solution containing 50mM Tris HCI (pH7.6). 10mM MgClj. 1mM ATP, 1mM DTT, 5% w/v PEG 8000 
and T4 DNA llgase (2 units; BRL) were Incubated at 16°C for 20 hours. The resulting mixture was used to trans- 
fomn competent E. coll HB101 cells (as supplied by BRL) as described herein. Transfonmants were selected 
10 for by growth on L-agar plates containing 50|ig/ml ampicillin and screened for the presence of the gene by col- 
ony hybridisation with a 32p labelled probe (SEQ ID No 24) as described herein. Plasmid DNA was prepared 
from 6 positively hybridising colonies, purified by centrifugation in a caesium chloride gradient and the sequence 
confirmed by dideoxy sequencing as described herein. 

The plasmid containing this gene was designated pICI 1080. 

IS 

d) Subcloning of an expression cassette containing a gene for [Seri7^7]G_csF into M13mp18. 

The following subcloning was effected to provide a starting point for preparation of the G-CSF derivatives 
detailed in Examples 3-8. 

20 Plasmid DNA from pICM 080 (purified by caesium chloride density centrifugation) was digested to comple- 

tion with EcoRI and Sal! (BCL) according to the manufacturer's instructions. The small EcoRI-Sall fragment 
containing the trp promoter and [Ser^^^^jQ.cSF gene was isolated from a 0.7% agarose gel by use of Genec- 
lean (trademark). This fragment was cloned into an EcoRI-Sall cut M13mp1 8 vector (DNA supplied by Amer- 
sham International; enzymes from BCL). The fragments were ligated together in 5x BRL ligation Buffer using 

25 BRL T4 DNA ligase (described previously). The ligation mix was used to transfect competent E. coli TG1 cells 
(made competent according to the calcium chloride method of Mandel and Higa described in Molecular Cloning 
- A Laboratory Manual - Maniatis et al Cold Spring Harbor). The transfected cells were suspended in TY top 
agar containing 2% X-Gal in DMF and 200ij.l log phase E. coli TG1 cells and were plated on 2x TY agar plates 
(TY top agar - 8g Bactotryptone, 5g Yeast Extract, 5g NaCI, 3.75g Bacto-agar in 500^1 sterile H2O; TY plates 

30 - 8g Bactotryptone, 5g Yeast-extract, 5g NaCl, 7.5g Bactoagar in 500 ml sterile H2O.) 

Four white plaques were picked into 4 x 2 ml 1% E. coll TG1 cells in TY broth (8g Bactotryptone, 5g Yeast 
extract, 5g NaCl in 500ml sterile H2O) aliquots and grown for 6 hours at 37**C. The 2ml cultures were split into 
0.5ml and 1.5ml aliquots. The bacteria were centrlfuged out of solution in an Eppendorf, (trademark) microfuge 
and the supematents were transferred to sterile eppendorf (trademark) tubes. The 0.5ml aliquots were stored 

35 at -20°C as phage stocks. The 1 .5ml aliquots were used to prepare single stranded DNA following the method 
in the Amersham International M13 sequencing handbook (see below). These DNA samples were then sequ- 
enced using oligonucleotides SEQ ID No 22, SEQ 1D No 23 and Ml 3 Universal sequencing primer. The reac- 
tions were canried out using the Sequenase kit (trademark) according to the manufacturers instructions. All 4 
clones had the correct DNA sequence for [ScM^-^tjg.qsf. 

40 

Large-scale single stranded DNA preparation 

For single stranded DNA preparations of between 200-500^g of DNA/ml, the method in the Amersham In- 
ternational "Oligonucleotide Directed Mutagenesis" was used. A detailed procedure is carried out as follows:- 

45 

LARGE - SCALE SINGLE STRANDED DNA PREP: 

A. Preparation of 1ml phage stock 

50 1 . Pick a single TGI E.coli colony from a glucose/minimal medium plate. Grow overnight in 1 0ml 2 x TY 

medium, shaken at 37**C. Add lO^tl to 20ml of fresh medium, and shake at 37°C for 3 hours. 

2. Inoculate 1ml 2 x TY medium In a 10ml sterile culture tube with lOOjil of 3 hour culture from step 1. 

3. Inoculate the 1ml culture with a recombinant plaque. 

4. Incubate for 4 hours with shaking at 37*'C. Transfer to a microcentrifuge tube. 

55 5. Centrifuge for 5 minutes at ambient temperature. Pour supernatent into a fresh tube. 

Store overnight at 4°C. Set up an overnight culture of TGI E.coli for the next stage. 
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B. Growth of 100ml phage culture. 

1. Jnoculate 100ml 2 x TY medium with 1ml of overnight TGI culture and shake at 37-C to an O.D 500 of 

2. Add the 1ml phage supernatent from A5 (above) to the 100ml culture. 

3. Incubate for 5 hours with shaking at 37°C. Transfer to centrifuge tubes. 

4. Centrifuge at 5000 x g for 30 minutes at 4°C, 

Lllf^n '"^ ^ ^'I^^" centrifuge tube. Take care not to carry over any cells (retain bacterial 

pellet for RF DNA preparation) 

LndlYl HouT^M? ^'^^ *° supernatent. Mix well and then leave to 

7. Centrifuge at 5000 x g for 20 minutes at 4°C. Dscard supernatant 

8. Centrifuge at 5000 x g for 5 minutes, and remove all remaining PEG/NaCI with a drawn out Pasteur 
pipette. 

?n'^^^"!S!f"'^ "^''^^ ""^^^^ transfer to a microcentrifuge tube (1 5ml) 

10. CenWuge for 5 minutes in a microcentrifuge to remove any remaining cells. Transfer the supematent 
to a fresh microcentrifuge tube. uK-oniaiBiu 

11. Add 200^1200/. PEG 12.5MNaCI to the supernatentmixwelithenleave to stand at ambient temperature 

Tor lo minutes. 

20 12. Centrifuge for 5 minutes, discard supernatent. 

13. Centrifuge for 2 minutes. Carefully remove all traces of PEG/NaCI with a drawn out Pasteur pipette 

14. Resuspend the viral pellet in 500^1 double distilled water. 

15. Add 200^1 phenol saturated with 10mM Tris HCI pH8.0, ImM EDTA. Vortex briefly 

16. Stand tube for 15 minutes at room temperature. 
25 17. Centrifuge for 3 minutes. 

18. Transfer supematent to fresh tube. 

19. Repeat steps 15-18. 

20. Add SOOfil chloroform and extract aqueous phase twice. 

21. Add 50^l 3M sodium acetate and 1ml absolute ethanoL Mix. 
30 22. Place in a dry ice and ethanol bath for 20 minutes. 

23. Centrifuge for 15 minutes. / 

24. Wash each pellet with 1ml -20«C ethanol. Pour off. 

di^DNr"" "^"^ ''^"^^ ^""^ '^'^^ '^''''^^^ "^^'^ P^^^^d"*-® yields 1 00-200Hg single stran- 
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EXAMPLE 6 



Preparation of pIC1 11 07 

The procedure described in Example 5 was repeated except as follows*- 
H bu;eMtS;Kor2Tou°.^^^^^^^ T4 polynucleotide kinase and digested with MstI, (10 units) in 1 X 
Following precipitation with ethanol. the 143 bp EcoRI-Mstll fragment was purified on a 1 Q% polyacrylamide 
?n Example 4^ electroelution from a gel slice and the DNA strands annealed as described 

in ir^*"^ T^T EcoRI-Mstll fragment described above was cloned into the plasmid vector pAG88 described 
o^. t- preparation. pAG88 (10ng) was digested with Mstll (20 units; BCL) in 1 X H buffer 

Sil.?^ ;h p °"p^.on'' "'^^ M sodium a^tate at 2^C 

I^fh ^S!^ I J ^^o." ^""-^ ''"^^^ °° 2 hours at SZ-C. Following precipitation 

^Ld!m«it H '^K h'k'^:'''"' ^""''^'^ °" ^ ^"'^ P-ifi^d using Geneclean 

^^Tl I''' manufacturer (Bio 101 . USA). Colonies containing the synthetic fragment were 

confirmed by screening with a radioactive probe prepared from oligonucleotide (SEQ ID No 1) and the correct 

for [Ser^ was designated pICI1 107. 

55 EXAMPLE 7 

Preparation of plasmid pICI 1239 

The site-directed mutagenesis procedure described below was employed using the mutagenic template 
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M13mp18 containing the gene for [Ser^'^'27]G.Q5p described in Example 5 or 6 (above). The mutagenic 
oligonucleotides used are designated SEQ 1D No 28 and SEQ 1D No 29 (as hereinafter defined). 

The triplet ACG in SEQ 1D No 28 serves to convert Gtn at position 11 to Arg and the first and last AGA 
triplets in SEQ ID No 29 serve to convert Pro at positions 65 and 60 to Ser. The mutagenesis was carried out 
5 as described below using SEQ ID No 29 in a single priming mutagenesis. This yielded a single plaque which 
incorporated the Pro 60 Ser and Pro 65 Ser changes. Single stranded DNA was prepared from this plaque as 
described in the mutagenesis procedure described below . This DNA was used as a mutagenic template in a 
single priming mutagenesis using SEQ ID No 28 as mutagenic primer. This yielded >100 plaques, 3 of which 
were screened by DNA sequencing as previously described. All 3 had the full set of changes incorporated. Dou- 
10 ble - stranded RF DNA was prepared from one of the plaques by following the procedure for large scale pre- 
paration of single stranded DNA (step d in Example 5) to step B5. The RF DNA was extracted from the bacterial 
pellet by the alkali lysis procedure of Bimboim and Doly (Nucleic Acids Research (1979) 7, 1513-1523) and 
purified by caesium chloride density gradient centrifugation as described in ''Molecular Cloning - a Laboratory 
Manual** by Sambrook, Fritsch and Maniatis (Cold Spring Harbor Publication). The purified RF DNA was diges- 
ts ted with EcoRI and Sail in buffer H as described previously and the 61 9bp fragment, containing the trp promoter, 
ribosome binding site, translation initiation codon and gene for [Seri7,27]G.QSF isolated from a 0.7% agarose 
gel by use of Geneclean (TM). The fragment was ligated into an EcoRI-Sall digested plCI0020 vector, using 
a 2:1 molar excess of insert to vector, with T4 DNA ligase (BRL) and ligase buffer, essentially as described 
previously. The ligation mix was used to transform E.Coli strain HB101. Transfomnants were selected for by 
20 growth on L-agar plates containing 50|ig/ml ampicillin. Colonies were screened for the presence of the inserted 
DNA by restriction analysis of plasmid DNA prepared by the method of Birnboim and Doly as described in 
''Molecular Cloning - a Laboratory Manual" Sambrook, Fritsch and Maniatis (Cold Spring Harbor Publication). 
Plasmid DNA from a colony containing the expected 619bp EcoRI - Sail 
insert was used to transform E.coli strain MSD522 and designated plCI1239. 

25 

Site-directed mutagenesis protocol 



40 



The phosphorothioate method of Eckstein and co-workers was used: 

Taylor, J W et al Nucleic Acids Research (1985) Vol pp 8749-8764 
30 Taylor, J W et al Nucleic Acids Research (1 985) Vol pp 8765-8785 

Nakamaye, K et at Nucleic Acids Research (1986) Vol pp 9679-9698 
Sayers, J R et al Nucleic Acids Research (1988) Vol pp 791-802 

The procedure can be carried out using a kit supplied by Amersham International. The method is outlined 
35 below and incorporates changes to the original method with regard to the use of more than one mutagenic 
oligonucleotide and the incubation temperature for oligonucleotides of greater than 30 bases in length. 

1. Annealing mutant oligonucleotide to single stranded DNA template: 

Single stranded DNA template (lyg/ul) 5yl 

Phosporylated mutagenic oligonculeotide ( 1 . 6pmol/lyl) 2,5wl 
Buffer 1 3.5m1 
45 Water 6m1 

(Where two mutagenic oligonucleotides were used simultaneously. 2.5^1 (1 .6pmole/1 nt) of each phosporylated 
oligonucleotide was added to 5|il single stranded DNA template (1 ng/^il) in 3.5(il Buffer 1 and 3.5^1 water. Where 
3 mutagenic oligonucleotides were used 2.5^l (1.6pmol/nl) of each phosporylated oligonucleotide was added 
50 to 5^1 single stranded DNA (1^g/^l in 3.5^l Buffer 1 and 1^l water). The above ingredients were placed in a 
capped tube in a 70**C water bath for 3 minutes if the oligonucleotide was <30bases in length or in a boiling 
water bath for 3 minutes if the oligonucleotide was > 30 bases in length. The tube was then placed in a 37°C 
water bath for 30 minutes. 



55 2. Synthesis and ligation of mutant DNA strand: 
To the annealing reaction were added 
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MgCl2 solution 5\x\ 

Nucleotide mix 1 19fil 
(contains dCTP alpha S) 

water e^il 

Klenow fragment (6 units) 1.5hI 

T4 DNA ligase (5 units) 2\i\ 

The above ingredients were placed in a 16°C water-bath and left overnight. 

3. Removal of single stranded (non-mutant) DNA using disposable centrifugal filter units. 



To the reaction from Step 2 the following ingredients were added-- 
Water 170^1 
SMNaCI 30^1 

The 250nl sample was added to the top half of the filter unit and centrifuged at 1500 rpm for 10 minutes 
at room temperature in a SORVALL RT6000B bench top centrifuge using a SORVALL HI OOOB swing out rotor 
Sample passes through two nitrocellulose membranes which bind the single stranded DNA leaving the double 
stranded DNA to pass through to the collection tube below. 

100^1 of 500 mM NaCI were added and respun for 10 minutes to wash through any remaining RF DNA 
The following ingredients were added to the filtrate:- 
20 3M Sodium Acetate (pH6.0) 28^tl 
Cold Ethanol (-20°C) 700^l 

The mixture was placed in a dry ice and ethanol bath for 20 minutes and centrifuged in an Eppendorf micro- 
fuge for 1 5 minutes. The plellet was then resuspended in 1 0^il buffer 2. 

25 4. Nicking of the non-mutant strand using Nci I. 

To the reaction mix from step 3, was added 65^1 Buffer 3 and 8 units Nci I (1^1). The mixture was placed 
in a 37**C water bath for 90 minutes. 

30 5. Digestion of non-mutant strand using exonuclease III 

To the reaction mix from step 4 was added 
500 mM NaCI 12^| 
Buffer 4 10^1 
35 Exonuclease III (SOunits) 2^l 

The mixture was placed in a 37-C water bath and incubated for 30 minutes at 37°C, 50 units of exonuclease 
III will digest approximately 3,000 bases in 30 minutes). The mixture was then placed in a 70-C water bath for 
15 minutes to inactivate the enzymes. 

40 6. Repolymerisation and ligation of the gapped DNA. 

To the reaction mix from step 5 was added 
nucleotide mix 2 , 13hI 

MgCl2 solution 5^1 
45 DNA polymerase I (4 units) 1^| 
T4 DNA ligase (2.5 units) 1^1 

The mixture was placed in a 16°C bath for 3 hours. 



7. Transfonmation of competent host E^coH TGI cells with the DNA: 

300^1 of freshly prepared competent BcoH TGI cells (prepared following the method of Mandel and Higa) 
were transfomned with 20^\ of the reaction mix from step 6 (in duplicate) 

Iv^rSf a^^^^^ were,plated out in a lawn of log phase TG1 cells in TY Top agar-on TY plates and incubated 

The E coli strain TGI is freely available from for example the E. coli Genetic Stock Centre. Yale University 
utvo^l from Amersham International pic. Amersham-Place, Little Chalfont, Amersham, Buckinghamshire 
RPN 1523r^ ^"^^"^"^ their "in yifrq" mutagenesis system, oligonucleotide directed kit (Product code 
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EXAMPLE 8 

Preparation of plasmid pIC1 1295 (also referred to as pCG300 ) 

5 (a) Production of pCG54 from plCI1079 

plCI1079 is an ampicillin resistant. pAT1 53-derived plasmid containing the following elements between the 
EcoRI and Styll restriction sites:- 
(i) a CI857 from phage X; 
10 (ii) a XPl promoter; 

(iii) a synthetic ribosome binding site; 

(iv) a synthetic interferon a2 gene sequence; 

(v) a synthetic transcription terminator sequence, derived from phage T4, between the Sal! and Sty! res- 
triction sites. 

15 The DNA sequence of this transcription terminator is shown in Figure 1(b) and SEQ ID No. 37. 

plCI1079 is illustrated in Figure 6. 

plCI1079 has been deposited under the Budapest Treaty, at the National Collections of Industrial and Marine 
Bacteria Limited (NCIMB), 23 St. Machar Drive, Aberdeen, AB2 1RY, Scotland, UK. (NCIMB No 40370, date 
of deposit 19 February 1991). 

20 pCG54 was constructed in order to make available an expression vector containing the same promoter, ribo- 
some binding site and transcription terminator sequences as above, ie: Xp^ RBS7 and T4, but lacking gene 
sequence encoding for production of a specific protein. Such a construct would provide the facility of a basic 
expression vector containing essential elements allowing transcription and translation for production of any pro- 
tein of interest which could be introduced into this vector by subsequent cloning events. 

25 Construction of the vector was initiated by restriction endonuclease cleavage of pIC1 1 079 at its respective 

EcoRI and Sail sites. This cleavage step released a vector fragment containing the plCI1079 backbone com- 
plete with genes for plasmid replication and antibiotic resistance functions, plus the T4 transcription terminator 
sequence. The fragment was isolated by agarose gel purification steps using Geneclean for final purification 
of the DNA fragment 

30 To this vector fragment a second smaller DNA fragment of approximately 1.2Kb in size was introduced. 

This second fragment may be obtained, for example by DNA synthesis or by site directed or PCR mutagenesis 
of the small EcoRI-Sall restriction fragment obtained from plCI1079 as described above. This second fragment 
contained exactly equivalent promoter and ribosome binding site sequences as originally present in plCI1079 
and additionally had EcoRI and Sail sites available at its 5' and 3' temriini respectively, so providing compatible 

35 tenmini for ligation to the plCI1079 fragment. A ligation reaction in the presence of Gibco-BRL enzyme T4 DNA 
ligase and its respeictive buffer, resulted in the formation of the construct pCG54. 

Clones containing this construct were originally isolated following transfomnation of an aliquot of the ligation 
reaction mixture into E.coli competent cells of strain HB101 . 

The Gonstmct pCG54 recovered was 3.682Kb in size and contained essential features as outlined on the 

40 map featured in Figure 11. 

(b) Production of pCG61 from pCG54 (also referred to as plCI54) 

Synthetic oligonucleotide sequences were designed so as to include both the natural sequence for the 
45 T7A3 promoter and also a sequence which would provide an effective translation initiation region to enable 
conrect processing of any polypeptide gene sequence cloned adjacent to it. A suitable candidate sequence for 
this latter region was identified as RBS1, the trp ribosome binding sequence. Therefore two complimentary 
oligonucleotides identified as SEQ ID No.38 and SEQ ID No.39 were synthesized to generate a double stranded 
DNA linker incorporating the T7A3 promoter and RBS1 sequences , 
50 Oligonucleotides were prepared as 84mers by the standard protocol using an ABI gene synthesizer. They 

were designed so that in the double stranded fonm the synthetic fragments would have restriction endonuclease 
sites EcoRI and Kpnl at the 5' and 3' ends respectively, Due to their length the oligomers could not be purified 
by means of HPLC and purification was undertaken by means of acrylamide gel electrophoresis using a 10% 
acrylamide: 7M Urea gel. 

55 Prior to purification, the oligomers were first checked on a sizing gel to ensure that not only are they of the 

correct size but that also the samples prepared contain as their greatest proportion the oligomers required and 
not a high contaminating proportion of smaller secondary oligonucleotides which result as by-products of synth- 
esis. 
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The acryfamide gels were prepared by standard methods with ammonium persulphate and N.N N' N'-tet- 
ramethylethylenediamine used as catalysts for gel polymerisation. 

Sizing of the oligonucleotides required that they could be visualized after electropohoresis. It was therefore 
necessary to radioactively label the samples using 32p. jhis made it possible to assess sample quality following 
electrophoresis by way of autoradiography. 

Oligonucleotide samples were supplied in a crude fonm unphosphorylated. This factor was made use of 
for radiolabelling purposes in that the samples could be 'hof labelled at the 5' termini by phosphorylation using 
the enzyme T4 polynucleotide kinase. 

Oligomers were provided from synthesis in an unphosphorylated form and so after purification each 
oligomer was individually subjected to a phosphorylation reaction in which ATP was used to phosphorylate the 
5' end of each molecule in the presence of T4 polynucleotide kinase, (see Molecular Cloning: A Laboratory 
manual 2nd Edition. Sambrook. Fristch and Maniatis, p 5.68-5.71). Once phosphorylated the two complimen- 
tary oligonucleotides were annealed together to fonm the double strand DNA duplex containing the T7A3 oro- 
moter and the RBS1 sequence. 

The vector molecule pCG54 was cleaved with restriction enzymes EcoRI and KpnI. On restriction digestion 
2.3kb vector fragment and a l.lkb fragment containing the Xp^. promoter and RBS1 sequence are generated 
This doning Step is planned to replace the Xp^ -RBS1 sequence by EcoRI to Kpni synthetic fragment comprising 
the T7A3-RBS1 sequence. The 2.3kb vector fragment resulting from digestion of pCG54 was purified by the 
usual protocol using agarose gel electrophoresis and Geneclean methodology for removal of DNAfrom agarose 
20 fragments. 

The 84bp EcoRI-Kpnl synthetic fragment was ligated into the vector molecule prepared above and the 
hgated DNA used to transfonn ExoH HB101 cells. Selection of positive recombinant clones was by ampicillin 
resistance. Following transformation a number of colonies containing recombinant plasmid were selected for 
screening purposes. 

25 The synthetic fragment incorporated into the vector during cloning was of a size (84 mer) such as to make 

restnction analysis of recombinant plasmid DNA samples Inappropriate as a simple screening method. Inserts 
of such a small size are not readily apparent on agarose gel electrophoresis. The fragment itself contains no 
internal restriction endonuclease cleavage site which could be diagnostic of its presence. Initial screening of 
reoornbmant clones was therefore by the method of colony hybridisation (see Grunstein and Hogness Proc 
Natl Acad. Sci 72, 3961 (1975)). Nitrocellulose filters containing immobilized plasmid DNAfrom the recombinani 
clones were hybridised against a probe prepared by random radiolabelling of the synthetic annealed oligonuc- 
leotide SEQ ID No. 38 and SEQ ID No. 39 . The DNA was labelled using a32P-dCTP and incubatton with Klenow 
polymerase at 37»C for 2 hours. Recombinant colonies which generated a positive hybridisatfon reaction were 
selected for plasmid DNA preparation. Plasmid DNA was prepared in each case by a relatively large scale 
35 method incorporating CsCI gradient density centrifugation to ensure purity see " Molecular Cloning - A laborat- 
ory manual "second edition. Sambrook Fritech and Maniatis (Cold Spring Harbor Laboratory. 1 989) pi 42-1 52 
Preparation of DNA by such a method ensures high quality material suitable for use in subsequent cloning mani^ 
pulations and sequence analysis. 

All plasmid DNA isolated from recombinant clones was included in a secondary screen by sequence 
40 analysis, to ensure that the oligonucleotide sequence at the cloning junctions and of the T7A3-RBS1 fragment 
Itself was absolutely con-ecL The sequencing protocol used was that of Sequenase and the sequencing primer 
selected for use was for example pBR322 UP (pBR322 universal primer). Sequencing was effected using the 
Sanger dideoxy chain tenmlnation sequencing technique. 

Clorjes having the correct sequence were designated as the new expression construct pCG61. and contained 
the T7A3 pronroter, RBS1 sequence and the T4 temiinator sequence (see Figure 8). 
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EXAMPLE 9 - PREPARATION OF RICIN A 



The following illustrates the use of plasmid plCI0042 in the preparation of ricin A. A DNA sequence coding 
50 for the ricin A was inserted into plasmid plCI0042 such that it was under the control of the trp promoter DNA 
lll"^"^^^°''^;''''^^'^'^^^^'*^^<i'forexarop^^^ in EP 145.111; Lamb. I.F.etal-. Eur. J. Biochem.. 1985 148 
265-270; and O Hare. M. etal., FEBS Letts.. 1987. 216. 73-78. The following describes the preparation oil^- 
eral intermediate stages in the derivation of the particular vector used to prepare recombinant ricin A. 

55 9.1 Synthetic oligonucleotides 

Synthetic oligonucleotides were used to introduce specific DNA sequence alterations of the ricin gene All 
oligonucleotides subsequently described were prepared on an Applied Biosystems 380A DNA synthesiser from 
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5'-dimethoxytrityl base-protected nucleoside-2-cyanoethyI-N,N- 

diisopropylphosphoramidites and protected nucleosides linked to controlled-pore glass supports on a 0.2 micro 
mol scale, according to protocols supplied by Applied Biosystems Inc. 

Each oligonucleotide, after cleavage from the solid support and removal of all protecting groups, was dis- 
5 solved In water (1ml) and a measurement of absorbance at 260nm used to determine concentration. 

9.2 Enzymes 

A variety of restriction endonucleases and DNA modifying enzymes were used in the manipulations des- 
10 cribed below. These were purchased from one of a number of suppliers (Amersham International, Bethesda 
Research Laboratories, Boehringer Mannheim or New England Biolabs) and used in accordance with the man- 
ufacturers instructions with respect to reaction conditions. 

9.3 Construction of the pICI exprerssion vectors 

15 

9.3 a) PICI0020 

As mentioned in Example 5(c), plasmid vector plCI0020 is a pAT153 based plasmid in which the 651 bp 
EcoRI-AccI region is replaced by a 167 bp EcoRI - Clal fragment consisting of:- 
20 (1) a synthetic E. coli trp promoter and trp leader ribosome binding site 

(2) a translation initiation codon 

(3) a multiple restriction enzyme recognition sequence derived from M13mp18, containing sites for Kpnl, 
BamHI, Xbal, Sail, PstI, SphI and Hindi!! 

(4) a synthetic transcription termination sequence 

25 The construction of a plasmid vector containing a synthetic trp promoter sequence is published (Windass 

et a[ NucAcids Res. 10 p6639-6657, 1 982). A promoter fragment was isolated from such a vector after digestion 
with the enzymes Eco RI and Hpa l and purification of the appropriate band from an agarose gel by electro-elution 
(in "Molecular Cloning - A Laboratory Manual", Maniatis, Fritsch and Sambrook, published by CSH laboratory, 
second edition 1989 and hereinafter referred to as "Maniatis"). 

30 A pair of complementary synthetic oligonucleotides were prepared which would ligate to the Hpa ! end of 

the promoter fragment providing the natural trp leader ribosome binding site, a translation initiation codon and 
a 3' Kpn ! cloning site. These oligonuleotides were mixed in equimolar concentrations and allowed to anneal 
by heating to 100°C followed by slowly cooling to room temperature. 

The promoter fragment and annealed oligonucleotides were then ligated and the appropriate band isolated 

35 from a polyacrylamide gel by electroelution. This fragment was then ligated with an Ml 3mp1 8 vector derivative 
containing the tr^ attenuator sequence (generated from synthetic oligonucleotides) cloned into the Hindlll site 
and introducing an additional Cla l restriction site 3' to the attenuator. The ligated DNA was transfected into E.coli 
strain JM109 (Yanisch-Perron et al Gene, 33, p103, 1985) made competent by the CaClj method (Maniatis. 
chapter 1p82). After plating out and incubation of the plates, plaques were screened by the method of Benton 

40 and Davies (Maniatis, chapter 4p41) using a 32p labelled probe generated by nick translation of the EcoRI- Hpa l 
promoter fragment isolated previously. Single stranded DNA was prepared from positively hybridising plaques 
by a standard method (Maniatis, chapter 4p29) and sequenced using the Ml 3 universal primer and the Sanger 
dideoxy chain termination method as provided in kit form by a number of suppliers eg. Sequenase (United 
States Bioscience). 

45 RF DNA was prepared from one isolate in which the promoter/ribosome binding site/attenuator sequence 

had been confinmed. This DNA was digested with Eco RI and Cla l and the appropriate fragment isolated from 
a polyacrylamide gel as above. Plasmid pAT153 was digested with the enzymes Eco RI and AccI and ligated 
with the isolated promoter fragment. Ligated DNA was used to transfomn competent E.coli HB101 (Bethesda 
Research Laboratories) and ampicillin resistant colonies selected. 

50 Plasmid DNA from several clones was prepared and DNA sequence derived from the region between the 

EcoR I and Clal sites. One clone confirmed as containing the correct promoter/attenuator region was named 
PICI0020. 

This construction is outlined in Fig.12. 

55 9.3 b) PICI1079 

As mentioned in Example 8(a), plasmid vector pICII 079 is an ampicillin resistant^ p ATI 53-d erived plasmid 
containing the following elements between the EcoRI and Styl restriction sites:- 
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(i) a CI857 gene from phage X.; 

(ii) a XP\_ promoter; 

(iii) a synthetic ribosome binding site; 

(iv) a synthetic interferon a2 gene sequence; 

5 (v) a synthetic transcription terminator sequence, derived from phage T4, between the Sail and Styl res- 

triction sites. The DNA sequence of this transcription tenmlnator is shown in Figure 1b. plCI1079 is illus- 
trated in Figure 6. 

plCI1079 has been deposited under the Budapest Treaty. The deposit has been made at the NCIMB. 23 St 
Machaer Drive, Aberdeen, Scotland. The date of deposit was 19 February 1991 and the number is NCIMB 
10 40370. 

This plasmid was used to provide a source of the T4 transcription terminator for the generation of the ricin 
A expressing clone plCI1185 (see 9.5.d below). The starting point for the generation of this plasmid was 
plCI1043. plCI1043 is a plasmid based on plCI0020 (see 9.3.a above) in which an expression cassette con- 
taining a kPi promoter and interferon a2 gene (Edge et al Nuc.Acids Res. 11 p6419-6435, 1983) is present 
15 between the Eco Ri and Sai l sites. 

A complementary pair of oligonucleotides was synthesised to generate the transcription terminator from 
gene 32 of bacteriophage T4 with 5' Sa[l and 3' Sph l cohesive ends. This fragment was ligated with a plasmid 
fragment isolated from plCI1043 which had been digested to completion with Sa[l and Sph l. The intermediate 
plasmid thus produced (plCI1078) contained both the T4 terminator and fr^ attenuator sequences in tandem. 
20 ' Asecond pair of complemetary oligonucleotides was then used to replace the trp attenuator sequence (and 
remaining part of the tetracycline resistance gene) by insertion between the Sph l and St^l sites of p!CI1078. 
A unique Bam HI site was introduced within this synthetic fragment. 

These manipulations are outlined in Fig. 13. 

25 9.4 Generation of a ricin A expressing clone 

9.4 a) Preparation of pUCSRA plasmid DNA 

A clone (pUCBRA) was generated which contains the DNA coding for ricin A. This clone contains A-chain 
30 cDNAfrom base number -74 in the leader sequence through to the Bam HI site within the B-chain (base number 
857) according to the published cDNA sequence (LambJ.F.. Roberts,L.M., Lord.J.M. Eur.J.Biochem , 1985. 
148. P265-270) in plasmid pUC8 (Vieira.J and Messing, J. Gene. 19, p259, 1982). In addition, site-directed 
mutagenesis has been used to generate a translation termination codon immediately 3' to the final codon of 
mature ricin A (as reported in O'Hare. M et a[ FEBS Letts. 1 987, 216 , p73-78). The entire A-chain coding region 
35 is included in a Bam HI fragment from this clone. 

A small quantity of pUCSRA plasmid DNA was obtained from the originators. For future stocks, a dilution 
of this DNA was used to transform into E.coli DH5a competent cells (Bethesda Research Laboratories) and 
an ampicillin resistant transformant selected. Plasmid DNA from this clone was prepared by a modified Birn- 
boim-Doly procedure (Maniatis, chapter 1 p25). Samples of this DNA were digested with Bam HI and BanI sepa- 
40 rately and compared to corresponding digests of the original sample of DNA after electrophoresis on an agarose 
gel. No differences In restriction pattern were observed and, on this basis, the two DNA samples were assumed 
to be identical. 

9.4 b) Sub-cloning into M13 

45 

Bam HI digests of pUC8RA plasmid DNA and RF (replicative fonm) DNA from the phage M13 strain K19 
(Anglian Biotechnology) were "shotgun" ligated using standard conditions (Maniatis, chapter 1p68). Control 
ligations were also performed. The ligated DNAs were used to transform E.coli strain TG1 (Gibson. 1984/Ang- 
lian) made competent by the CaClz method (Maniatis, chapter 1p82). 

50 The transfomnation frequences indicated efficient ligation and recombinant phage were expected in the pro- 

geny. Recombinant phage were predicted to produce clear plaques on IPTG + X-gal (BRL) containing plates 
due to disruption of the lacZ (p-galactosidase) gene. Wild type phage produce blue plaques due to conversion 
of the X-gal by p-galactosidase. 

Several clear plaques were picked for single strand DNA preparation. Direct gel electrophoresis of lysed 

55 phage suspensions indicated that one phage clone contained a sizeable insert which was confirmed by 
sequencing to be the ricin A-chain coding sequence. Only 182 bases of the mature ricin A coding sequence 
were confirmed but this was taken as sufficient evidence for the presence of the entire ricin A gene. This clone 
was named M13K19RA 
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9.4 c) Mutagenesis of M13K19RA 

To generate a Kpn l site, compatible with pICI expression vectors, at the start of mature ricin A, the following 
changes (underlined) are necessary:- 

5 

SEQ.ID.NO. A4 

10 

5' GATAACAACATATTCCCCAAA 3' 

Ricin leader sequence Mature ricin A > 

15 

Changed to: 
SEQ,ID,No.A5 

5 ' G AT AACAACATGGT ACCCAAA 3 ' 

I Kpn l 
I 

25 Translation initiation 

and result in an ATG codon overlapping a Kpn l site. A Kpnl fragment containing ricin A can be excised from 
the mutant and inserted into the ICI expression vector series . Two N-terminal amino acid modifications are 
made (ile- phe to met-val ). 

30 The single stranded DNA prepared from M1 3K1 9RA was the template for the mutagenesis step for each 

mutation strategy. A single oligonucleotide (DTR1 6) introducing all the mutational changes for this strategy was 
synthesised. 

DTR16 5' AACAACATGGT ACCCAAACAA 3' SEQ.ID.NO. 46 

35 

Several protocols exist for the introduction of specific DNA sequence changes by site directed 
mutagenesis. The procedures outlined below were achieved using the method of Eckstein et al (Nuc. Acid Res., 
1985. 13 p8749-8764 and 1986. 14. p9679-9698) as provided in kit form (Amersham International) and used 
in accordance with the manufacturers instructions. 

40 The principle of this method is to prime the single-stranded DNA template with the mutagenic oligonuc- 

leotide and synthesise the complementary strand incorporating dATPaS in place of dATP. Using this nucleotide 
results in the formation of phosphorothioate bonds which are not cleaved by certain restriction enzymes (eg. 
Neil ). After synthesis of the second strand. Nei l is used to nick the parent strand and exonuclease III added to 
digest back past the mutation point. DNA polymerase I then allows resynthesis of the parent strand. Conse- 

45 quently, the mutagenic oligonucleotide acts as a template for resynthesis and the mutation is introduced into 
both strands prior to transformation. Mutation frequencies up to 96% of the total progeny are claimed and 
screening is simply performed by picking plaques at random for sequence analysis. 
In our experiments 4 out of 4 plaques picked were correctly mutated. 

Having chosen one mutant (MRA16), RF DNA was prepared and checked for the presence of the newly 
50 generated restriction fragment ie Kpn l. 

9.4.d) Cloning, Expression and Initial Characterisation 

The pICI series of expression vectors (see section 5) can accept DNA fragments cloned into a unique Kpn l 
55 restriction site adjacent to the Tr^ promoter. The Kpn l site overiaps the translation initiation codon (ATG) which 
is situated 8bp downstream from the Shine-Dalgamo site (AGGA) of the promoter. 

Having verified the sequence of MRA16. a large scale (-5^g RF DNA) Kpn l digest was performed and the 
relevant ricin A coding DNA fragment isolated from an agarose gel (Nu-Sieve GTG agarose. FMC Bio-products) 
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by phenol extraction of an excised gel slice according to the manufacturer's protocol. 

plCI0020 (see 9.3a) was digested with Kpn l and then dephosphorylated using calf intestinal alkaline phos- 
phatase (CIP - Boehringer Mannheim). The latter treatment prevents recircularisation of the vector upon ligation 
which would lead to a high proportion of parentals In the transformation progeny. 
5 Ligations were set up with ratios of plasmid vector to Isolated fragment from 8:1 (w/w) to 1 :3 for the various 

strategies. Control ligations to test the effectiveness of phosphatase treatment, ligase activity etc.. were 
included. The ligation conditions were appropriate for the source of T4 DNA ligase used (New England Biolabs 
or Amersham). Reactions were generally incubated at 15^*0 overnight. 

Fifty percent of each ligation (5^1) reaction was diluted to 100^l with 1 x TNE (50mM Tris, 50mM NaCI, 
10 ImM EDTA) and 200^1 of competent E.coll DS41 0 added. After a standard transformation protocol (Maniatis! 
chapter 1p74). the cells were plated onto L agar plus streptomycin (25^g/ml) and ampicillin (lOOMg/ml) and incu- 
bated at ZT^C overnight. E.coli DS410 has a chromosomal streptomycin resistant gene. 

The transformation plates were examined after incubation. In general. 5 to 10 times more colonies were 
seen in ligations compared to controls without ligase. In some cases, little difference in the number of colonies 
15 produced in the presence or absence of ligase occurred indicating incomplete digestion of the vector or poor 
ligase activity. 

Transfonmants. plus the relevant controls were picked onto nitrocellulose filters placed on L agar plates for 
hybridisation screening (based on the method of Grunstein and Hogness as described in Maniatis, chapter 
1p98). After incubation, the colonies were lysed in situ using 10% SDS and 1M NaOH, neutralised using 1M 
20 Tris (pH 7.5) and dried under vacuum at 80^C for 2 hours. 

Hybridisation probes were generated by labelling of the mutational oligonucleotides using T4 polynuc- 
leotide kinase. The filters were probed at room temperature and then washed in stages up to 55-65°C to remove 
non-specifically bound counts before autoradiography. Specific hybridisation indicated putative clones contain- 
ing ricin A DNA. 

25 Small scale DNA preparations (by the methods of Holmes and Quigley or Birnboim-Doly as specified in 

Maniatis. chapter 1 p25) were made from positively hybridising clones. The DNAs were digested with the relev- 
ant restriction enzymes eg. Kginl and EcoRI/Bglll. and analysed by electrophoresis on agarose gels. Vector 
DNAs and mutated RF DNAs were cut with the same enzymes to demonstrate the fragment sizes expected 
for the conrect clones. 

30 Larger scale plasmid DNA preparations (Birnboim-Doly) of each clone were used for more detailed res- 

triction analysis, eg. Clal. Hindlll, Bam HI, EcoRI/BglH Kpn l. and ScaL On agarose gels, these digests showed 
the size of fragment inserted, an indication of its orientation and the gain of some unique ricin A-chain enzyme 
sites. 

35 9.4.e) Expression studies 

The clones positively identified by hybridisation and restriction screening were tested for expression of ricin 
A by SDS-PAGE analysis of total cell lysates. The standard conditions for expression studies were:- 

1 ) Inoculate 1 0ml of L-broth + antibiotic(s) with a single colony and grow at 37**C overnight with gentle shak- 
40 ing. 

2) Take 750^1 of the L-broth ovemight and pellet the cells in a microfuge (1 min at 6500 rpm). 

3) Resuspend pellet in 300^\ M9 medium (Maniatis. appendix A.3) + 0.02% casein hydrolysate + 0.2% glu- 
cose + 50|ig/ml thiamine and inoculate into 10ml of same. 

4) Incubate for 7 hours or overnight at 37**C with gentle shaking. 

45 5) After incubation, measure OD540. pellet the cells and resuspend to OD540 = 1 0 per ml in Laemmli sample 

buffer (Maniatis. chapter 18p53). Boil for 15 minutes. 

6) Load 20^l of total cell lysate on an SDS polyacrylamide gel. electrophorese. stain with Coomassie blue, 
^ destain and visualise. 

Of the clones studied by SDS-PAGE. only 1 showed an additional band with 
50 equivalent molecular weight of -29KD (equivalent to that estimated for unglycosylated. mature ricin A). Gel 
scans indicated the expression level to be in the range of 5-10% of total cell protein. This clone was named 
PICI1102. 

The construction of pICI1 102 is outlined in Fig.14. Results of expression studies are shown in Figs.15 and 

16. 

55 

9.4 f) Western transfers and immunodetection of recombinant ricin A 

Authenticity of recombinant ricin A-chain protein, initially observed by Coomassie blue staining of SDS- 
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polyacrylamide gels, was confirmed by Western blotting. The protein bands were transferred to nitrocellulose 
filters and detected using a ricin A specific antibody followed by peroxidase labelled antiglobulins. 

1 5% SDS-PAGE gels were run overnight at 8mA then equilibrated for at least 30 minutes in transfer buffer. 

Protein bands on the gels were then transferred to nitrocellulose membranes (Hybond-C. Amersham) 
5 electrophoretically in a Bio-Rad Trans Blot apparatus at 70V for 3 hours. The filters could be stored, after drying, 
in sealed plastic bags at -20''C. 

Ricin A.1 was a polyclonal antibody raised in rabbits against a synthetic peptide fragment of ricin A. Pre- 
liminary studies showed good affinity for ricin A but considerable cross-reactivity with many E.coli proteins. To 
overcome the high background caused by this cross-reactivity the antibody was pre-incubated with an E.coli 
10 lysate. 

Thus, a 10ml L-broth ovemight culture of E.coli strain DS410 was centrifuged at 4000 rpm for 10 minutes 
to pellet the cells. The pellet was resuspended in 5ml of bacterial buffer and sonicated at4-6^for 6x 10 second 
bursts with 30 seconds cooling intervals on ice. 

0.5ml of sonicate was then mixed with 0.5ml of ricin A.1 antiserum and incubated at room temperature for 
15 90 minutes. Cell debris was spun down at 13000 rpm for 5 minutes and the supernate stored at -20°C. 

The nitrocellulose filters from Westem transfers were blocked by incubation ovemight at room temperature 
in 5% BSA-PBS/Tween. (PBS Tween = 5ml Tween 20 per 1 litre of PBS). 
Washed 3x3 minutes in PBS/Tween. 

Incubated 2 hours (or overnight) at room temperature with a 1/4000 dilution of "blocked" Ricin A.1 antibody in 
20 0.5% BSA-PBS/Tween. 

Washed 3x3 minutes in PBS/Tween. 

Incubated 1 hour with a 1/1000 dilution of goat anti rabbit antiserum in 0.5% BSA-PBS/Tween at room tenv 
perature. 

Washed 3x3 minutes in PBS/Tween. 
25 Incubated 1 hour with a 1/5000 dilution of rabbit peroxidase anti-peroxidase antiserum in 0.5% BSA/PBS/Tween 
at room temperature. 

Washed 3x3 minutes in PBS/Tween. - 
Developed by immersion in a solution of 4-chloronaphthol (60mg) In 20ml methanol made to 120ml with PBS 
and containing 12|il hydrogen peroxide. The membrane was removed from the solution as soon as bands were 
30 visible, dried and photographed. 

A typical Western blot analysis is shown in Fig. 17. 

9.4 g) Biological assay for recombinant ricin A protein 

35 The aim here was to establish conditions under which samples generated during the ricin A-chain purifi- 

cation from E.coli cells could be tested for biological activity in a cell-free in vitro protein synthesis assay. 

Rabbit reticulocyte lysates were prepared according to the method of Allen and Schweet (J Biol Chem 
(1962), 237, 760-767). The assay demonstrates inhibition of protein synthesis in a cell-free system by a lack 
of incorporation of ^'♦C-labelled leucine into newly synthesised protein. 

40 

9.4 g.i) The assay protocol 

Stock solution: ImM amino acid mix minus leucine. 
A solution containing all L-amino acids at ImM except leucine (adjusted to pH7.4 with NaOH and stored at - 
45 70<»C). 

Soln. A 

40mM Magnesium acetate 
2M Ammonium acetate 
0.2M Tris 

50 (pH 7.4 with HC1 , stored 4^C) 

Soln. B 

ATP (Sigma A5394) 246mg/ml 
GTP (Sigma G8752) 24.4mg/ml 
Assay mix: 1ml Amino acid mixture 
55 1ml Soln. A 

0.1ml Soln. B 

103mg Creatine phosphate 
Img Creatine kinase 
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510^l H2O 

eOO^il (eo^iCi) L-^^C-leucine (New England Nuclear. NEC-279E) 
Reaction mix: Test sample 25|il 
Assay mix 12.5^l 
5 Rabbit reticulocyte lysate 25^l 

Blank solution was 2mg/ml BSA in PBS 
All assays were done in duplicate 
12.5^il of assay mix placed in sterile glass tubes 25^1 of BSA In PBS added to each of first four tubes for 
blanks 25^l of test samples added to rest of tubes 1ml 0.1 M KOH added to first two tubes (background blank) 
10 Tubes equilibrated to 28^C in a water bath 25^l of rabbit reticulocyte lysate (allowed to thaw from liquid nitrogen 
temperature) were added to each tube at 20 second Intervals. When first tube had incubated for 12 minutes, 
1ml 0.1 M KOH was added to each tube again at 20 second intervals to allow all tubes to have 12 minutes incu- 
bation. Two drops of 20% hydrogen peroxide were added to each tube followed by 1ml of 20% TCA. 
Tubes were mixed and allowed to stand for at least 1 hour, or overnight, at 4*>C. The precipitates were filtered 
15 on to 2.5 cm GFC discs, washed with 3 x 4 ml of 5% TCA, transferred to scintillation vials and 10ml scintillant 
(Ready-Solv. MP, Beckman) added. After 1 hour the vials were shaken and counted. 

9.4 g.ii) Establishment of technique for use with E.coli lysates 

20 1 0ml L-broth overnight cultures were grown at 37''C. 400^il aliquots were pelleted at 1 3000 rpm for 30 sec- 

onds and most of the supernate decanted. 

The pellets were subjected to 2 rounds of rapid freezing in dry ice/EtOH followed by thawing at 37°C. 12^l 
of 25% sucrose in 50mM Tris HC1 pH8.0 were added followed by 4^1 of a lOmg/ml solution of lysozyme. 
After incubation on ice for 15 minutes, 8^1 of 0.25M EDTA were added and incubation continued for 15 
25 minutes. Lysis was brought about osmotically by diluting the samples to 400^1 with water. This procedure pro- 
duced viable cell counts of 80-100 per ml. 

When a 25^1 aliquot of this lysate was added into the assay reaction mix, the level of incorporation of i^c- 
leucine into newly synthesised protein was -1 0% of the blank without lysate. This was a similar level of inhibition 
to that produced by 8ng/ml ricin A. Dilutions of the E.coli lysate were then prepared and the assay repeated. 
30 The result clearly showed that a minimum 1 6-fold dilution was necessary to reduce the effect of the lysate to 
equal that of the blank. 

In order to be as confident as possible that lysis of E.coli and E.coli lysates would not compromise ricin A 
toxicity. 2 control assays were performed. The first added plant-derived ricin A to a 16X diluted E.coli cell pellet 
so as to give a final concentration of 8ng/ml in the assay mix after cell lysis. Both these controls showed no 
35 deleterious affect from the lysates or the lysis procedure on the Inhibitory action of ricin A. 

These techniques were used to verify the synthesis of biologically active, recombinant ricin A from pIC1 1 1 02 
and the clones described below. 

9.4 h) DNA sequence analysis 

40 

Plasmid DNA sequencing was used to analyse pICI1 1 02. The protocol chosen was modified from Zagursky 
etal (Gene Analysis Techniques Vol 2, N° 5) and involves alkaline denaturation of double stranded plasmid 
DNA prior to primer annealing and sequencing by a standard procedure such as that provided in kit form by 
several suppliers, eg. Sequenase (United States Bioscience). By using an oligonucleotide to prime at the 3' 
45 end of p-lactamase and several A-chain internal primers, sequencing both strands of the promoter and ricin A 
gene was possible. 

The initial sequencing data revealed an unexpected result in that an additional KenI fragment was present 
between the promoter and ricin A coding sequence, ie: 

50 



55 
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SEQ.ID.NO. hi 



5 ' AAAAAGGGTATCGACATGGTACCCGGGGATCCACCTCAGGGTGG 

to 



Kpn l 



t5 

TCTTTCACATTAGAGGATAACAACATGGTACCCAAACAATAC 3 ' 

The additional Kpn l fragment has come from M13K19RA and contains restriction enzyme sites plus the 
part of the ricin leader sequence cioned from pUCSRA. The 5' region of the ricin A chain contains the base 
20 changes induced during mutagenesis. 

Study of this sequence reveals that the first translation initiation codon (ATG) is out of frame with that the 
ricin A coding region. Also, there is an in-frame termination codon (TAG) prior to the ricin A initiation codon 
and a putative Shine-Dalgamo sequence (AGGA) which could re-initiate translation from the second ATG. - 
Subsequent studies revealed that, surprisingly, this additional DNA fragment conferred a beneficial advan- 
25 tage with respect to the accumulation level of ricin A-chain in E.coli when compared to clones from which it 
had been excised. 

The complete DNA sequence of the ricin Agene contained in plCI1102 is given in Fig. 18. 

9.5. Generation of subsequent ricin A expressing clones 

9.5 a) Mutation of Ricin-A done plCI1102 to allow subcloning 

To subclone the two Kpn l fragments from the fortuitously generated pIC1 1 1 20 in the correct orientation for 
ricin-A expression would be difficult. Consequently, we planned to alter the internal Kpn l recognition site by a 
35 single base substitution (A to T). This would prevent Kpn l cleavage at this site and allow the subcloning of a 
single K^nl fragment into the range of tr^/RBS vectors. By substituting the adenine of the Kpn l recognition site 
(GGTAGC) with thymine (ie GGTTCC) the first residue of ricin-A is unaltered (GTA/GTT = Val). 
ie: 



40 



Kpnl 53bp fragment Kpn l Ricin-A sequence Kpn l 



GGTAGC ATGGTACC j GGTAGC 

^ TGA 



Changed to: 



50 



*^P" I 53bp fragment Ricin-A sequence Kpn l 

J- I I __ 

GGTAGC ATGGTTCC | GGTACC 

I TGA 
not recognised by Kpn l- 
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The oligonucleotide synthesised to produce this change has the sequence: 

5' ATAACAACATGGTTCCCAAACAATAC 3' 

5 

where the underlined base represents the mutational change: 

We planned to clone the mutated ricin-A fragment into a range of trp expression vectors for comparative 
expression studies. Cloning into plCI0020 provides a comparison with pICI 1102 to determine the effects on 
expression, if any, of the single base substitution. 

10 

9.5 b) Mutagenesis 

The template for mutagenesis was MRA16 which is the M1 3 clone containing the two Kpn l fragments pre- 
sent in pICI 11 02. After mutagenesis, isolates carrying the desired mutations were identified by random sam- 
15 pling and DNA sequence detemnlnation over the region to which the mutagenic oligonucleotide binds 
specifically. 

One mutated template was named MRA22. This was analysed further by DNA sequence determination of 
the entire ricin-A coding sequence to verify the absence of non-specific mutations. 

20 9.5 c) Sub-cloning 

The mutated, single-stranded DMAs were used to transform competent Bcoli TG1 cells to produce single 
plaques. Individual plaques were then picked and replicativeform (RF, double-stranded) DNA purified by band- 
ing on caesium chloride/ethidium bromide buoyant density gradients. The purified RF DNA was digested to 
25 completion with Kpn l. Cloning was achieved by "shotgun" ligation of the digested RF DNA with the appropriate 
KenI cut and phosphatased expression vector or by specific ligation of the ricin-A fragment after its purification 
from an agarose gel. Ligated DNA was transfomned into E.coii TG1 or HB101. 

Ricin-A containing clones were identified by hybridisation screening using a ^^P labelled ricin-A probe pro- 
duced by random hexanucleotide priming of a Kpn l fragment isolated from another ricin A containing clone (pICI 
30 1121). Colonies showing positive hybridisation were screened further by restriction analysis of plasmid DNA 
using a Kj^l single digest and an EcoRI/Bglll double digest. Kpn l identifies the size of the inserted fragment 
and Eco RI/BqIII determines the orientation of the fragment 

Clones confirmed as having the ricin-A fragment in the correct orientation for expression were subjected 
to clone selection grows and analysis by SDS-PAGE followed by Coomassie staining and Western blotting of 
35 duplicate gels. The level of ricin A accumulation in these clones was equivalent to that detected from pIC1 1 1 02. 

One isolate was selected and named plCI1131. 

9.5 d) Use of an alternative transcription terminator element , 

40 In these experiments, the t[2 promoter and ricin-A fragment from pICI 1131 was excised by digestion with 

the enzymes EcoRI and Sai l. The latter enzyme cleaves between the 3' terminus of the ricin-A coding sequence 
and the tr^A transcription terminator. The resulting fragment was excised from an agarose gel (2% NuSieve 
GTG Agarose, FMC Bioproducts) and purified by phenol and chloroform extractions followed by ethanol pre- 
cipitation. The purified fragment was ligated with pIC1 1 079 cut with EcoRI and Sail. This latter plasmid contains 

45 the T4 terminator between unique Sail and Sph I sites. 

Ligated DNA was used to transform competent E.coii HB101 (BRL) and hybridisation screening used to 
detect the presence of ricin-A DNA as in previous experiments. Positively hybridising clones were chosen for 
plasmid DNA preparation followed by restriction analysis with EcoRI and Sail together to show the presence 
of an appropriately sized fragment 

50 One isolate with the conrect construction was identified and named pICI1 185. 

9.5 e) Generation inducible tetracycline selection vector 

The clone, pICH 185 . was used to produce a further construct by subcloning the expression cassette into 
55 a further vector. pICI 0042. Plasmid DNA was prepared from plCI1185 and digested with EcoRI and Sph I 
together to excise an expression cassette containing the tr^ promoter/RBSI/ricin-A (MRA22) fragment/T4 ter- 
minator. This fragment was isolated by the method outlined in 9.4d and ligated with pICI 0042 cut with EcoRI 
and Sph I. 
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Ligated DNA was used to transform E.coli HB101. HB101 transformations were plated on L agar + tetracyc- 
line, and incubated at SZ'C overnight and colonies screened by hybridisation with a labelled ricin-A DNA 
probe. 

In both cases, positively identified colonies were confirmed by restriction analysis of plasmid DNA using 
5 EcoRI/Sph I and EcoRI/Bql ll digests. Three isolates were identified ie pICH 187.1-3. 
Fig.1 9 outlines the construction of plCI1185 and plCI1187. 

9.5 f) Clone selection 

10 The clones isolated were used to transfonm E.coli 71.18 and single colonies picked for clone selection 

studies. The resulting whole cell lysates were electrophoresed on duplicate SDS-PAGE gels, one of which was 
stained with Coomassie blue and the other used for Western blot analysis. 

The stained gel provided minimal data on ricin-A expression due to the presence of a co-migrating protein 
from E.coli 71.18. Western blotting cleariy indicated ricin-A expression in comparison to positive and negative 

15 control samples. One Isolate was used in the fermentations described below. 

RICIN A CHAIN FERMENTATION 

Plasmid pICI 1187 was transformed into E. coli strain MSD68 and the resultant recombinant (MSD1051) 
20 purified and maintained on glycerol stocks at -80°C. 

An aliquot of the culture was removed from stock and streaked onto agar plates of L-tetracycline to separate 
single colonies after overnight growth at 37°C. A single colony of MSD 1051 was removed and resuspended 
in 10ml L- tetracycline broth and 100^1 immediately inoculated into each of 10 250ml Erhlenmeyer flasks con- 
taining 75ml L-tetracycline broth. After growth for 1 6 at 37°C on a reciprocating shaker the contents of the flasks 
25 were pooled and used to inoculate a fermenter containing 20L modified LCM50 growth medium. 

Composition of Modified LCM50 



Made up of distilled water 



KH^PO^ 3.0 

35 Na2HP0^ 6.0 

NaCl 0.5 

Casein hydrolysate (Oxoid LAI) 2.0 

^ im^)^SO^ 10.00 

Yeast Extract (Difco) 20.00 

Glycerol 35.00 

MgSO^. 7E^0 0.5 

45 CaCl2. 2H^0 0.03 

Thiamine 0.008 

FeSO^/Citric Acid 0.94/0.02 

Trace element solution (TES) 0.5ml 



Fenmentations were then canried out at a temperature of 37**C and pH, controlled by automatic addition of 
6M sodium hydroxide solution, of pH 6.7. The dissolved oxygen tension (dOT) set point was 50% air-saturation 
and was controlled by automatic adjustment of the fermenter stirrer speed. Air flow to the fermenter, initially 
55 20L/min, corresponding to 1 volume per volume per minute (WM) was increased to 45L/min when the fermenter 
stirrer speed approached 80-90% of Its maximum. 

Throughout the fermentation samples were taken for measurement of optical density (OD550). cell dry 
weight and accumulation of ricin A chain within the cells. Rictn A chain accumulation was measured by scanning 

31 
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Coomassie blue stained SDS-PAGE gels of whole cell lysates of the sampled bacteria as is well known in the 
art. 

4^2 h after inoculation, yeast extract (Difco) solution (225g/L) was pumped into the femienters at a rate of 
1.7g/L/h. 

5 After 12h when OD550 reached approximately 50, and before the fermentation became oxygen-limited bac- 

teria were harvested on a Sorval RC3B centrifuge (700g, 30 min. 4°) and accumulated protein recovered from 
the bacteria. 

NOTE: 

10 

E. coll DS41 0 (also refenred to as MSD68 herein) is well known (Dougan and Shen*att, Molecular and Gen- 
eral Genetics. Vol 151, pi 51 -160, 1977) and has the published genotype F~ ara azi ton A lac Y min A min B 
rpsLmal Axyl mtl thi. This strain is freely available to the public, and moreover was deposited by the Applicants 
on 7 June 1985, under the Budapest Treaty, with the National Collections Of Industrial & Marine Bacteria Ltd, 
IS Aberdeen, Scotland under deposition number 12100. 

The cells were collected from the fermentation broth using a continuous disc stack intermittent discharge 
separator. The broth (501 from 2 x 251 fennentation) was initially transferired from the fermenters to a 501 trundle 
tank and transported to a contained system consisting of a number of holding tanks connected to the separator 
and homogenizer. 

20 The trundle tank was connected to this system and the broth pumped through the centrifugal separator at 

a flow rate of 40l/h. The discharge rate was adjusted so that the centrifuge supernatant was clear by visual 
inspection of an eyeglass in the supernatant discharge line. The supernatant was collected in a kill tank con- 
taining .201 of 0.1 M sodium hydroxide sanitizing solution prior to disposal. The cells were resuspended in 401 
of Buffer A (50mM sodium dihydrogen orthophosphate, 25mM ethylene diamine tetra acetic acid, 5mM ben- 

25 zamidine, 2mM dithiothreitol, pH 6.3 with 5N sodium hydroxide) and prechilled to 8**C in the solids receiver ves- 
sel. The suspended cells were then transferred back to the trundle tank via the homogenizer adjusted to a 
working pressure of 600 bar. The resulting homogenate (601) was chilled to <20**C and make 0.5% with respect 
to polythenemine by the addition of 2.51 of a 10% (v/v) solution. The suspension was allowed to flocculate for 
10 min before transfer to the Holding Tank via the centrifugal separator. The clear supernatant was then steri- 

30 lized by purifying through a depth filter and a positively charged 0.2)i membrane filter. 

The sterile clarified supernatant was concentrated to a volume of 121 using aspiral cartridge cross flow fil- 
tration device and the solution brought to 40% saturation by the addition of 2.9KG of solid ammonium sulphate 
crystals. The solution was allowed to flocculate by gentle stirring overnight at 15°C and then centrifuged using 
the continuous flow centrifuge. The discharged slurry was stored at 70°C until required for further processing. 

35 The ammonium sulphate precipitate was thawed in the presence of 141 of Buffer B (50mM sodium dihyd- 

rogen orthophosphate, 25mM elthylene diamine tetracetic acid, 2mM dithiothreitol, pH 6.3 with 5N sodium hyd- 
roxide). After 30min the suspension was clarified by centrifugation and desalted by diafiltration against 701 of 
Buffer B and the conductivity checked that it had been reduced to below 3MS/cm. The desalted solution was 
claified further by centrifugation and processed immediately. 

40 The desalted solution was slowly added to a batch chromatography tank containing 2kg of DEAE-cellulose 

which had been equilibrated with 601 of Buffer B. After stirring for 6.5h the unbound r-ricin solution was pumped 
from the bottom of the tank through anil .3cm diam x 1 0cm column of packed and equilibrated DEAE-cellulose 
at a flow rate of 80ml/min. The bulk of the r-ricin A did not bind and was collected in a stainless steel vessel. 
The r-ricin A solution was adjusted to pH 5.5 with 1 M orthophosphonic acid and applied to a 1 0cm diameter 

45 X 10cm column of carbosymethyl agaros equilibrated with 101 of Buffer C (25mM sodium dihydrogen orthophos- 
phate, 5mM ethylene diamine tetra acetic acid, 2mM dithiothreitol, pH 5.5 with 5N sodium hydroxide). The r-ricin 
A bound to this column and after washing with 101 of Buffer 0 was eluted with Buffer D (25mm sodium dihyd- 
rogen orthophosphate, 5mM ethyl diamine tetracetic acid, 2mM dithiothreitol, lOOmM sodium chloride. pH 5.5 
with 5N sodium hydroxide). The pure r-ricin A eluted as a single peak which was collected and stored at 4°C 

50 as a sterile solution until required for ftjrther processing. The r-ricin A is stable under these conditions for up 
to 2 months. 
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SEQUENCE LISTING 



5 The following is a list of the sequences referred to in the 

application. The sequences are written in the conventional 5' to 
3 'sense. 

10 

SEQ ID No 1 

SEQUENCE LENGTH: 62 bases 
SEQUENCE TYPE: Nucleotide 
15 STRANDEDNESS: Single 
TOPOLOGY: Linear 

AATTCAGT ACT CCA CTG GGT CCA GCA AGC TCT CTG CCG CAG TCT TTC 47 
CTG CTG AAG TGT CTC 62 

20 

SEQ ID No 2 

SEQUENCE LENGTH: 64 bases 
25 SEQUENCE TYPE: Nucleotide 
STRANDEDNESS: Single 

TOPOLOGY: Linear : 

CTG TTC GAG AC A CTT CAG CAG GAA AG A CTG CGG CAG AGA GCT TGC 45 

30 

TGG ACC CAG TGG AGT ACTG 64 



SEQ ID No 3 
35 SEQUENCE LENGTH: 60 bases 

SEQUENCE TYPE: Nucleotide 
STRANDEDNESS: Single 
TOPOLOGY: Linear 

40 

GAA CAG GTA CGT AAA ATT CAA GGC GAT GGT GCG GCT CTG CAG GAA 45 
AAG CTG TGC GCA ACC 60 



50 



55 
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SEQ ID No A 

SEQUENCE LENGTH: 60 bases 
SEQUENCE TYPE: Nucleotide 
STRANDEDNESS : Single 
TOPOLOGY: Linear 

TTT GTA GGT TGC GCA CAG CTT TTC CTG CAG AGC CGC ACC ATC GCC 45 
TTG AAT TTT ACG TAC 60 

SEQ ID No 5 

SEQUENCE LENGTH: 48 bases 

SEQUENCE TYPE: Nucleotide 

STRANDEDNESS: Single 

TOPOLOGY: Linear 

TAC AAA CTG TGC CAC CCT GAG GAA CTG GTG CTG CTC GGT CAC TCT CTG 48 
SEQ ID No 6 

SEQUENCE LENGTH: 51 bases 

SEQUENCE TYPE: Nucleotide 

STRANDEDNESS : Single 

TOPOLOGY: Linear 

CGG GAT CCC CAG AGA GTG ACC GAG CAG CAC CAG TTC CTC AGG GTG 45 
GCA CAG 51 

SEQ ID No 7 

SEQUENCE LENGTH: 63 bases 
SEQUENCE TYPE: Nucleotide 
STRANDEDNESS: Single 
TOPOLOGY: Linear 

GGG ATC CCG TGG GCT CCA CTG AGC TCT TGC CCG TCC CAA GCT TTA 45 
CAA CTG GCA GGC TGC TTG 63 



45 



50 



55 
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SEQ ID No 8 

SEQUENCE LENGTH: 60 bases 
5 SEQUENCE TYPE: Nucleotide 
STRANDEDNESS: Single 
TOPOLOGY: Linear 

CTG GCT CAA GCA GCC TGC GAG TTG TAA AGC TTG GGA CGG GCA AGA 

10 

GCT CAG TGG AGC CCA 

SEQ ID No 9 
1S SEQUENCE LENGTH: 63 bases 

SEQUENCE TYPE: Nucleotide 

STRANDEDNESS: Single 

TOPOLOGY: Linear 

20 

AGC CAG CTG CAC TCC GGT CTG TTC CTG TAG CAG GGT CTG CTG CAG 

GCT CTA GAA GGC ATC TCT 

25 SEQ ID No 10 

SEQUENCE LENGTH: 63 bases 
SEQUENCE TYPE: Nucleotide 
STRANDEDNESS: Single 

30 

TOPOLOGY: Linear 

TTC AGG AGA GAT GCC TTC TAG AGC CTG CAG CAG ACC CTG GTA CAG 
GAA CAG ACC GGA GTG CAG 

35 

SEQ ID No 11 

SEQUENCE LENGTH: 60 bases 
SEQUENCE TYPE: Nucleotide 

40 

STRANDEDNESS: Single 
TOPOLOGY: Linear 

CCT GAA TTG GGG CCC ACC CTG GAC ACA CTG CAG CTG GAC GTT GCC 
^ GAC TTC GCT ACT ACC 



50 



55 
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SEQ ID No 12 

SEQUENCE LENGTH: 63 bases 
SEQUENCE TYPE: Nucleotide 
STRANDEDNESS: Single 
TOPOLOGY: Linear 

TTG CCA TAT GGT AGT AGC GAA GTC GGC AAC GTC CAG CTG CAG TGT 45 
GTC CAG GGT GGG CCC CAA ^3 



SEQ ID No 13 

SEQUENCE LENGTH: 63 bases 
SEQUENCE TYPE: Nucleotide 
STRANDEDNESS: Single 
TOPOLOGY: Linear 
ATA TGG CAA CAG ATG GAG GAA CTG GGT ATG GCT CCG GCA CTG CAG 45 
CCG ACT CAG GGT GCG ATG ^3 

25 SEQ ID No 14 

SEQUENCE LENGTH: 60 bases 
SEQUENCE TYPE: Nucleotide 
STEIANDEDNESS: Single 
TOPOLOGY: Linear 

TGC TGG CAT CGC ACC CTG AGT CGG CTG CAG TGC CGG AGC CAT ACC 45 
CAG TTC CTC CAT CTG 



60 



SEQ ID No 15 

SEQUENCE LENGTH: 60 bases 
SEQUENCE TYPE: Nucleotide 
STRANDEDNESS: Single 
TOPOLOGY: Linear 

CCA GCA TTC GCC TCT GCT TTC CAG CGG CGC GCA GGC GGT GTT CTG 45 
GTT GCC TCC CAT CTT 



50 



55 
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SEQ ID No 16 

SEQUENCE LENGTH: 60 bases 
^ SEQUENCE TYPE: Nucleotide 
STRANDEDNESS: Single 
TOPOLOGY: Linear 

GCT CTG AAG ATG GGA GGC AAC CAG AAC ACC GCC TGC GCG CCG CTG A5 
GAA AGC AGA GGC GAA 50 

SEQ ID No 17 

SEQUENCE LENGTH: 55 bases 
SEQUENCE TYPE: Nucleotide 
STRANDEDNESS: Single 
TOPOLOGY: Linear 

CAG AGC TTC CTC GAG GTG TCT TAC CGC GTT CTG CGT CAC CTG GCC 45 
CAG CCG TTAG 55 



15 



20 



25 



SEQ ID No 18 
SEQUENCE LENGTH: 53 bases 
SEQUENCE TYPE: Nucleotide 
STRANDEDNESS: Single 
TOPOLOGY: Linear 

TCGACTTA CGG CTG GGC CAG GTG ACG CAG AAC GCG GTA AGA CAC CTC ^7 
GAG GAA 33 



35 



40 



45 



50 



55 



SEQ ID No 19 

SEQUENCE LENGTH: 21 bases 
SEQUENCE TYPE: Nucleotide 
STRANDEDNESS: Single 
TOPOLOGY: Linear 

TACAACTGGCAGGCTGCTTGA 21 
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20 



SEQ ID No 20 

SEQUENCE LENGTH: 21 bases 
SEQUENCE TYPE: Nucleotide 
STRANDEDNESS: Single 
TOPOLOGY: Linear 
GACGTTGCCGACTTCGCTACT 



21 



21 



21 



SEQ ID No 21 

SEQUENCE LENGTH: 21 bases 
SEQUENCE TYPE: Nucleotide 
STRANDEDNESS : S i ngle 
TOPOLOGY: Linear 
TGCCGGAGCCATACCCAGTTC 
SEQ ID No 22 

SEQUENCE LENGTH: 21 bases 
SEQUENCE TYPE: Nucleotide 
25 STRANDEDNESS: Single 
TOPOLOGY: Linear 
GCCTGCCAGTTGTAAAGCTTG 

SEQ ID No 23 

SEQUENCE LENGTH: 26 bases 
SEQUENCE TYPE: Nucleotide 
35 STRANDEDNESS: Single 
TOPOLOGY: Linear 
GCACCATCGCCTTGAATTTTACGTAG 

40 

SEQ ID No 2A 

SEQUENCE LENGTH: 62 bases 

SEQUENCE TYPE: Nucleotide 

^ STRANDEDNESS : Si ngle 

TOPOLOGY: Linear 

AATTCAGT ACT CCA CTG GGT CCA GCA AGC TCT CTG CCG CAG TCT TTC 47 
CTG CTG AAG TCT CTC 



26 



50 



62 



55 
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SEQ ID No 25 

SEQUENCE LENGTH: 64 bases 
SEQUENCE TYPE: Nucleotide 
STRANDEDNE S S : S i ng 1 e 
TOPOLOGY: Linear 

CTG TTC .GAG AGA CTT CAG GAG GAA AGA CTG CGG CAG AGA GCT TGC 45 
TGG ACC CAG TGG AGT ACTG 

SEQ ID No 26 

SEQUENCE LENGTH: 60 bases 
SEQUENCE TYPE: Nucleotide 
STRANDEDNESS: Single 
TOPOLOGY: Linear 

GAA CAG GTA CGT AAA ATT CAA GGC AGC GGT GCG GCT CTG CAG GAA 45 
AAG CTG TGC GCA ACC 



SEQ ID No 27 
SEQUENCE LENGTH: 60 bases 
SEQUENCE TYPE: Nucleotide 
STRANDEDNESS: Single 
^ TOPOLOGY: Linear 

TTT GTA GGT TGC GCA CAG CTT TTC CTG CAG AGC CGC ACC GCT GCC 45 
TTG AAT TTT ACG TAC 

SEQ ID No 28 

SEQUENCE LENGTH: 29 bases 

SEQUENCE TYPE: Nucleotide 

STRANDEDNESS: Single 

TOPOLOGY: Linear 

CTT CAG CAG GAA AGA ACG CGG CAG AGA GC 29 

45 



50 



55 



39 



BNSDOCID: <EP 0502637A2_I_> 



EP 0 502 637 A2 



10 



15 



SEQ ID No 29 

SEQUENCE LENGTH: 33 bases 
SEQUENCE TYPE: Nucleotide 
STRANDEDNESS: Single 
TOPOLOGY: Linear 

GC TTG GGA AGA GCA AGA GCT CAG AGA AGC CCA C 
SEQ ID No 30 

SEQUENCE LENGTH: 168 + 166 bases 
SEQUENCE TYPE: Nucleotide 
STRANDEDNESS : Double 
TOPOLOGY: Linear 

AATTCTGGCA AATATTCTGA AATGAGCTGT TGACAATTAA TCATCGAACT 50 
GACCGT TTATAAGACT TTACTCGACA ACTGTTAATT AGTAGCTTGA 46 

AGTTAACTAG TACGCAAGTT CACGTAAAAA GGGTATCGAC 90 
TCAATTGATC ATGCGTTCAA GTGCATTTTT CCCATAGCTG 86 

so AATGGTACCC GGGGATCCTC TAGAGTCGAC CTGCAGGCAT GCAAGCTTAG 140 
TTACCATGGG CCCCTAGGAG ATCTCAGCTG GACGTCCGTA CGTTCGAATC 136 



20 



35 



45 



32 



CCCGCCTAAT GAGCGGGCTT TTTTTTAT 
GGGCGGATTA CTCGCCCGAA AAAAAATAGC 



168 
166 



SEQ ID No 31 

40 SEQUENCE LENGTH: 534 bases 

SEQUENCE TYPE: Nucleotide vith corresponding protein 

STRANDEDNESS: Single 

TOPOLOGY: Linear 



AATTCAGT ACT CCA CTG GGT CCA GCA AGC TCT CTG CCG CAG TCT TTC CTG 50 
Thr Pro Leu Gly Pro Ala Ser Ser Leu Pro Gin Ser Phe Leu 
50 1 5 10 
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CTG AAG TCT CTC GAA CAG GTA CGT AAA ATT CAA GGC AGC GGT GCG GCT 
Leu Lys Ser Leu Glu Gin Val Arg Lys He Gin Gly Ser Gly Ala Ala 
15 20 25 30 



98 



10 



CTG CAG GAA AAG CTG TGC GCA ACC TAG AAA CTG TGC CAC CCT GAG GAA 
Leu Gin Glu Lys Leu Cys Ala Thr Tyr Lys Leu Cys His Pro Glu Glu 
35 40 A5 



146 



15 



CTG GTG CTG CTC GGT CAC TCT CTG GGG ATC CCG TGG GCT CCA CTG AGC 
Leu Val Leu Leu Gly His Ser Leu Gly He Pro Trp Ala Pro Leu Ser 
50 55 60 



19A 



20 



TCT TGC CCG TCC CAA GCT TTA CAA CTG GCA GGC TGC TTG AGC CAG CTG 
Ser Cys Pro Ser Gin Ala Leu Gin Leu Ala Gly Cys Leu Ser Gin Leu 
65 70 75 



242 



25 



30 



35 



40 



CkC TCC GGT CTG TTC CTG TAC CAG GGT CTG CTG CAG GCT CTA GAA GGC 290 
His Ser Gly Leu Phe Leu Tyr Gin Gly Leu Leu Gin Ala Leu Glu Gly 
80 85 90 

ATC TCT CCT GAA TTG GGG CCC ACC CTG GAC ACA CTG CAG CTG GAC GTT 338 
He Ser Pro Glu Leu Gly Pro Thr Leu Asp Thr Leu Gin Leu Asp Val 
95 100 105 110 

■ ■ 

GCC GAC TTC GCT ACT ACC ATA TGG CAA CAG ATG GAG GAA CTG GGT ATG 386 
Ala Asp Phe Ala Thr Thr He Trp Gin Gin Met Glu Glu Leu Gly Met 
115 120 125 



45 



GCT CCG GCA CTG CAG CCG ACT CAG GGT GCG ATG CCA GCA TTC GCC TCT 
Ala Pro Ala Leu Gin Pro Thr Gin Gly Ala Met Pro Ala Phe Ala Ser 
130 135 140 



434 



50 



GCT TTC CAG CGG CGC GCA GGC GGT GTT CTG GTT GCC TCC CAT CTT CAG 
Ala Phe Gin Arg Arg Ala Gly Gly Val Leu Val Ala Ser His Leu Gin 
145 145 155 



482 



55 
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AGC TTC CTC GAG GTG TCT TAG CGC GTT CTG CGT CAC CTG GCC GAG CCG 
Ser Phe Leu Glu Val Ser Tyr Arg Val Leu Arg His Leu Ala Gin Pro 
160 165 170 174 



530 



10 



15 



TAA G 

SEQ ID No 32 
SEQUENCE LENGTH: 
SEQUENCE TYPE: 
STRANDEDNESS: 
TOPOLOGY: 



534 



81 bases 
Nucleotide 
Single 
Linear 



20 



GAATTCAACA AAACGGTTGA CAACATGAAG TAAACACGGT ACGATGTACC 50 
ACAAGTTCAC GTAAAAAGGG TATCGACAATG 81 



25 



30 



35 



40 



45 



SEQ ID No 33 
SEQUENCE LENGTH: 
SEQUENCE TYPE: 
STRANDEDNESS: 
TOPOLOGY: 



67 67 bases 
Nucleotide 
Double 
Linear 



TCGACATTAT ATTACTAATT AATTGGGGAC CCTAGAGGTC CCCTTTTTTA TTTTAAAAAG 60 
GTAATA TAATGATTAA TTAACCCCTG GGATCTCCAG GGGAAAAAAT AAAATTTTTC 56 



CATGCGA 
GTACGCTTCGA 

SEQ ID No 34 
SEQUENCE LENGTH: 
SEQUENCE TYPE: 
STRANDEDNESS: 
TOPOLOGY: 



67 
67 



118 bases 
Nucleotide 
Single 
Linear 



50 



AATTCTGGCA AATATTCTGA AATGAGCTGT TGACAATTAA TCATCGAACT 
AGTTAACTAG TACGCAGAGC TCAATCTAGA GGGTATTAAT AATGTTCCCA 
TTGGAGGATG ATTAAATG 



50 
100 
118 
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SEQ ID No 35 
SEQUENCE LENGTH: 
SEQUENCE TYPE: 
STRANDEDNESS : 
TOPOLOGY: 



35 + 35 bases 
Nucleotide 
Double 
Linear 



10 



AGCTCCATAT GGTACCAGAT CTCTCGAGAG TACTT 

GGTATA CCATGGTCTA GAGAGCTCTC ATGAAGATC 



35 
35 



15 



20 



SEQ ID No 36 
SEQUENCE LENGTH: 
SEQUENCE TYPE: 
STRANDEDNESS: 
TOPOLOGY: 



23 + 15 bases 
Nucleotide 
Double 
Linear 



25 AGCTCAGCTG CAGCATATGG TAG 
GTCGAC GTCGTATAC 



23 
15 



30 



35 



SEQ ID No 37 
SEQUENCE LENGTH: 
SEQUENCE TYPE: 
STRANDEDNESS: 
TOPOLOGY: 



72 + 72 bases 
Nucleotide 
Double 
Linear 



40 



TCGACATTAT ATTACTAATT AATTGGGGAC CCTAGAGGTC CCCTTTTTTA TTTTAAAAAG 60 
GTAATA TAATGATTAA TTAACCCCTG GGATCTCCAG GGGAAAAAAT AAAATTTTTC 56 



45 



CATGCGGATC CC 
GTACGCCTAG GGGAAC 



72 
72 



50 
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SEQ ID No 38 

SEQUENCE LENGTH: 84 bases 
SEQUENCE TYPE: Nucleotide 
STRANDEDNESS: Single 
TOPOLOGY: Linear 

AAT TCA ACA AAA CGG TTG ACA ACA TGA AGT AAA CAC GGT ACQ ATG 45 
TAC CAC AAG TTC ACG TAA AAA GGG TAT CGA CAA TGG TAG 84 



SEQ ID No 39 

SEQUENCE LENGTH: 76 bases 
SEQUENCE TYPE: Nucleotide 
STRANDEDNESS: Single 
TOPOLOGY: Linear 

CAT TGT CGA TAC CCT TTT TAC GTG AAC TTG TGG TAC ATC GTA CCG 45 
TGT TTA CTT CAT GTT GTC AAC GGT TTT GTT G 76 



30 

SEQ ID No 40 

SEQUENCE LENGTH: 24 + 24 bases 

SEQUENCE TYPE: Nucleotide 

STRANDEDNESS: Double 

TOPOLOGY: Linear 



20 



25 



AATTCGCATG CGGATCCATC GATC 24 
GCGTAC GCCTAGGTAG CTAGAGCC 24 

SEQ ID No 41 

SEQUENCE LENGTH: 174/177 Amino acids 

SEQUENCE TYPE: Amino acid 

TOPOLOGY: Linear 
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Thr Pro Leu Gly Pro Ala Ser Ser Leu Pro Gin 

15 10 

Ser Phe Leu Leu Lys Cys Leu Glu Gin Val Arg 

15 20 



10 



Lys lie Gin Gly Asp Gly Ala Ala Leu Gin Glu 
25 30 



IS 



Lys Leu (Val Ser Glu)^ Cys Ala Thr Tyr Lys Leu 



35 



40 



20 



Cys His Pro Glu Glu Leu Val Leu Leu Gly His 
45 50 



25 



Ser Leu Gly lie Pro Trp Ala Pro Leu Ser Ser 
55 60 



30 



Cys Pro Ser Gin Ala Leu Gin Leu Ala Gly Cys 
65 70 



35 



40 



45 



Leu Ser Gin Leu His Ser Gly Leu Phe Leu Tyr 
75 80 85 

Gin Gly Leu Leu Gin Ala Leu Glu Gly He Ser 

90 95 

Pro . Glu Leu Gly Pro Thr Leu Asp Thr Leu Gin 
100 105 

Leu Asp Val Ala Asp Phe Ala Thr Thr He Trp 
110 115 



50 



Gin Gin Met Glu Glu Leu Gly Met Ala Pro Ala 
120 125 



55 
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Leu Gin Pro Thr Gin 
130 

Ala Ser Ala Phe Gin 

145 

Leu Val Ala Ser His 
155 

Val Ser Tyr Arg Val 
165 

Pro 

(where m is 0 or 1), 



Gly Ala Met Pro Ala Phe 
135 

Arg Arg Ala Gly Gly Val 

150 

Leu Gin Ser Phe Leu Glu 
160 

Leu Arg His Leu Ala Gin 
170 



SEQ ID NO 42 
T7A3 PROMOTER SEQUENCE 
SEQUENCE LENGTH: 46 bases 
SEQUENCE TYPE: Nucleotide 
STRANDEDNESS: single 
TOPOLOGY: linear 

AACAAAACGG TTGACAACAT GAAGTAAACA CGGTACGATG TACCAC 



SEQ ID NO 43 

lacO SEQUENCE 

SEQUENCE LENGTH: 22 bases 

SEQUENCE TYPE: Nucleotide 

STRANDEDNESS: single 

TOPOLOGY: linear 

AATTGTGAGC GGATAACAAT TT 



46 



EP 0 502 637 A2 



SEQ ID NO 44 

SEQUENCE LENGTH: 21 bases 
SEQUENCE TYPE: Nucleotide 
STRANDNESS: single 
TOPOLOGY: linear 

GATAACAACA TATTCCCCAA A 



SEQ ID NO 45 

SEQUENCE LENGTH: 21 bases 
SEQUENCE TYPE: Nucleotide 
STRANDNESS: single 
TOPOLOGY: linear 

GATAACAACA TGGTACCCAA A 



SEQ ID NO 46 

SEQUENCE LENGTH: 21 bases 

30 

SEQUENCE TYPE: Nucleotide 
STRANDNESS: single 
TOPOLOGY: linear 

35 

AACAACATGG TACCCAAACA A 



SEQ ID NO 47 

SEQUENCE LENGTH: 86 bases 
SEQUENCE TYPE: Nucleotide 
STRANDNESS: single 
TOPOLOGY: linear 

AAAAAGGGTA TCGACATGGT ACCCGGGGAT CCACCTCAGG GTGGTCTTTC 
ACATTAGAGG ATAACAACAT GGTACCCAAA CAATAC 



47 
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Claims 

1. A vector which comprises an inducible selection gene, and a sequence which codes for a heterologous 
polypeptide. 

2. A vector as claimed in claim 1 wherein the selection gene comprises the tetA and tetR genes. 

3. A vector as claimed in claims 1 or 2 which includes a DNA sequence which is capable of conferring stability 
on the vector. 

4. A vector as claimed in claim 3 wherein the DNA sequence capable of conferring stability on the vector 
comprises the cer sequence. 

5. A vector as claimed in any one of the preceding claims which includes a transcription terminator as found 
at the terminus of gene 32 of bacteriophage T4. 

6. A vector as claimed in any one of the preceding claims which includes a promoter selected from the trp 
promoter and the T7A3 promoter. 

7. A vector which comprises a replicable plasmidic expression vehicle comprising a promoter, the cer sequ- 
ence, a transcription terminator as found at the tenminus of gene 32 of bacteriophage T4. an origin of rep- 
lication and a DNA sequence which codes for a heterologous polypeptide. 

8. A vector as claimed in any one of the preceding claims wherein the heterologous polypeptide is selected 
from ricin A and G-CSF or an analogue thereof. 

9. A host capable of expressing a heterologous polypeptide, which host comprises a vector as claimed in 
any one of the preceding claims. 

10. A process for preparing a host as defined in claim 9. said process comprising transforming a host by inser- 
tion therein of a vector as defined in any one of claims 1 to 8. 

11- A process for preparing a polypeptide, said process comprising culturing a host as defined in claim 9 so 
that polypeptide is expressed. 

12. A replicable plasmidic expression vehicle which comprises an inducible selection gene comprising the tetA 
and tetR genes, and a DNA sequence which codes for a heterologous polypeptide. 

13. A host which comprises a replicable plasmidic expression vehicle as defined in claim 12. 
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TRANSCRIPTION TERMINATION SEQUENCE 

Fig. 1(a) 

Sail 

5 ' TCGACATT ATATT ACTAATTAATTGGGG ACCCT AGAGGTCCCCTTTTTTATTTTAA 
3 ' GTAATATAATGATTAATTAACCCCTGGGATCTCCAGGGGAAAAAATAAAATT 



Sghl Hindlll 
AAAGCATGCA 3 ' 

TTTCGTACGTTCGA 5 ' 



Fig. Kb) 

Sail 

5 ' TCGACATTATATTACTAATTAATTGGGGACCCTAGAGGTCCCCTTTTTTATTTTAA 
3 ' GTAATATAATGATTAATTAACCCCTGGGATCTCCAGGGGAAAAAATAAAATT 



SphI BamHI Styl 
AAAGCATGCGGATCCC 3' 
TTTCGTACGCCTAGGGGAAC 5' 
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Fig. U. 
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Fig.5(a) 

EcoRl 

AATTCTGGCA AATATTCTGA AATGAGCTGT TGACAATTAA TCATCGAACT 
GACCGT TTATAAGACT TTACTCGACA ACTGTTAATT AGTAGCTTGA 

Hpal 

AGTTAACTAG TACGCAAGTT CACGTAAAAA GGGTATCGAC 
TCAATTGATC ATGCGTTCAA GTGCATTTTT CCCATAGCTG 

Kpnl BamHI Xbal Sail PstI SphI 
AATGGTACCC GGGGATCCTC TAGAGTCGAC CTGCAGGCAT GC^^GCTTAG 
TTACCATGGG CCCCTAGGAG ATCTCAGCTG GACGTCCGTA CGTTCG^J^TC 

Clal 

CCCGCCTAAT GAGCGGGCTT TTTTTTAT 
GGGCGGATTA CTCGCCCGAA AAAAAATAGC 
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Fig. 5(b) 



EcoRI 

AATTCTGGCA AATATTCTGA AATGAGCTGT TGACAATTAJk TCATCGAACT 
Hpal 

AGTTAACTAG TACGCAGAGC TCAATCTAGA GGGTATTA.AT AATGTTCCCA 
TTGGAGGATG ATTAATG 
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Fig. 7. 
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Fig. 8. 
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I — • — • 1 



EcoRI Ptrp Hpal RBS Kml 
I 1 1 




ATCGGGCGGATTACTCGCCCGAAAAATAGCTA . .TACGA 



Fig. 12. 
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Hgal RBS { IIP LEADER) Kgnl 

AACTAGTACGCAAGTTCACGTAAAAAGGGTATCGACAATGGTAC 
TTGATCATGCGTTCAAGTGCATTTTTCCCATAGCTCTTACC 



EcoRI 



ISOLATE EcoRI/Clal 
FRAGMENT 



LIGATE 




EcoRI / 
AccICUT 



ISOLATE LARGE 
EcoRI /Act I FRAGMENT 



EcoRI 

H PfTp /RBS 



Hp_al X^^pnl 

Xbal> Sail 
Psj-I % SpJiJ 

Hindlll 

PICI0020 ^ 




trQ ATTENUATOR 
Clal/AccI 



AccI 




CONSTRUCTION OF piCl0020 Fig.12 COIlt 
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Fig. 15. 
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Fig. 18. 

PARTIAL SEQUENCE OF DICX11Q2 



Trp Promoter 

TTCTGGCAAATATTCTGAAATGAGCTGTTGACAATTAATCATCGAACTAGTTAACTAGTA 
1 — , + + ^ ^ ^ 



S-D rbs Met Kpnl BamHI 

CGCAAGTTCACGTAAAAAGGGTATCGACAAT GGTACC CGG GGATCCA CCTCAGGGTGGTC 
61 — + + + ^ ^ ^ 

Stop Kpnl Ricin A- > 

TTTCACATTAGAGGATAACAACATGGTACCCAAACAATACCCAATTATAAACTTTACCAC 

121 + + + + + ^ 



MVPKQYPI INFTT 



AGCGGGTGCCACTGTGCAAAGCTACACAAACTTTATCAGAGCTGTTCGCGGTCGTTTAAC 
+ + ^ ^ ^ ^ 

AGATVQS YTNFIRAVRGRLT 



AACTGGAGCTGATGTGAGACATGAAATACCAGTGTTGCCAAACAGAGTTGGTTTGCCTAT 
+ + + ^ ^ ^ 

TGADVRHEIPVLPNRVGLPI 



AAACCAACGGTTTATTTTAGTTGAACTCTCAAATCATGCAGAGCTTTCTGTTACATTAGC 
+ + + + . ^ 

NQRFILVELSNHAELSVTLA 



CCTGGATGTCACCAATGCATATGTGGTCGGCTACCGTGCTGGAAATAGCGCATATTTCTT 
+ + ^ ^ ^ ^ 

LDVTNAYVVGYRAGNS AYFF 



TCATCCTGACAATCAGGAAGATGCAGAAGCAATCACTCATCTTTTCACTGATGTTCAAAA 
421 + + + + + 

HPDNQEDAEAITHLFTDVQN 



TCGATATACATTCGCCTTTGGTGGTAATTATGATAGACTTGAACAACTTGCTGGTAATCT 
+ + + + ^ ^ 

RYTFAFGGNYDRLEQLAGNL 



GAGAGAAAATATCGAGTTGGGAAATGGTCCACTAGAGGAGGCTATCTCAGCGCTTTATTA 
+ + + ^ ^ 

RENIELGNGPLEEAISALYY 



TTACAGTACTGGTGGCACTCAGCTTCCAACTCTGGCTCGTTCCTTTATAATTTGCATCCA 

601 -- + + + + + + 660 

YSTGGTQLPTLAR5FIICIQ 
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FigMcont) 



AATGATTTCAGAAGCAGCAAGATTCCAATATATTGAGGGAGAAATGCGCACGAGAATTAG 
661 + + + + . + ^ 

MISEAARFQYIEGEMRTRIR 



GTACAACCGGAGATC TGCACC AGATC C TAGCG TAATTACACTTG AGAATAG T TGGGGGAG 
+ + + + ^ ^ 

YNRRSAPDPSVITLENSWGR 



ACTTTCCACTGCAATTCAAGAGTCTAACCAAGGAGCCTTTGCTAGTCCAATTCAACTGCA 
781 + _..+ + + + ^ 

LSTAIQESNQGAFASPIQLQ 
AAGACGTAATGGTTCCAAATTCAGTGTGTACGATGTGAGTATATTAATCCCTATCATAGC 

841 + + + + _ _^ ^ gQQ 

RRNG SKF SVYD.VS ILI P I lA 



TCTCATGGTGTATAGATGCGCACCTCCACCATCGTCACAGTTTTGATTGCTTATAAGGCC 
901 + + _ + + ^ 

LMVYRCAPPPSSQF* 

Kpnl Xbal PstI _S2hI_HindIII 
AGTGGTACCCGGGGATCCTCTAGAGTCGAC CTGCAGG CATGC AAGCTTA GCCCGCCTAAT 
961 + + + + + +j^020 

Terminator 

GAGCGGGCTTTTTTTTATCGACCGATGCCCTTGAGAGCCTTCAACCCAGTCAGCTCCTTC 
1021 + + + + ^ + 1080 

CGGTGGGCGCGGGGCATGACTATCGTCGCCGCACTTATGACTGTCTTCTTTATCATGCAA 
1081 + + + + + +11^Q 
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